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Description 

FIELD OF THE INVENTION 

5 [0001] The present invention generally relates to the field of genetic engineering and more particularly to growth 
factors for endothelial cells and growth factor genes. 

BACKGROUND OF THE INVENTION 

w [0002] Developmental growth, the remodelling and regeneration of adult tissues, as well as solid tumor growth, can 
only occur when accompanied by blood vessel formation. Angioblasts and hematopoietic precursor cells differentiate 
from the mesoderm and form the blood islands of the yolk sac and the primary vascular system of the embryo. The 
development of blood vessels from these early (in situ) differentiating endothelial cells is termed vasculogenesis. Major 
embryonic blood vessels are believed to arise via vasculogenesis, whereas the formation of the rest of the vascular 

15 tree is thought to occur as a result of vascular sprouting from pre-existing vessels, a process called angiogenesis, 
Risau etai., Devel. Biol., 725:441-450 (1988). 

[0003] Endothelial cells give rise to several types of functionally and morphologically distinct vessels. When organs 
differentiate and begin to perform their specific functions, the phenotypic heterogeneity of endothelial cells increases. 
Upon angiogenic stimulation, endothelial cells may re-enter the cell cycle, migrate, withdraw from the cell cycle and 

20 subsequently differentiate again to form new vessels that are functionally adapted to their tissue environment. En- 
dothelial cells undergoing angiogenesis degrade the underlying basement membrane and migrate, forming capillary 
sprouts that project into the perivascular stroma. Ausprunk et al., Microvasc. Rev., 74:51-65 (1977). Angiogenesis 
during tissue development and regeneration depends on the tightly controlled processes of endothelial cell proliferation, 
migration, differentiation, and survival. Dysfunction of the endothelial cell regulatory system is a key feature of many 

25 diseases. Most significantly, tumor growth and metastasis have been shown to be angiogenesis dependent. Folkman 
etai, J. Bioi. Chem., 267:10931-10934 (1992). 

[0004] Key signals regulating cell growth and differentiation are mediated by polypeptide growth factors and their 
transmembrane receptors, many of which are tyrosine kinases. Autophosphorylated peptides within the tyrosine kinase 
insert and carboxyl-terminal sequences of activated receptors are commonly recognized by kinase substrates involved 
30 in signal transduction for the readjustment of gene expression in responding cells. Several families of receptor tyrosine 
kinases have been characterized. Van der Geer etai., Ann. Rev. CeiiBioi., 70:251-337 (1994). The major growth factors 
and receptors transducing angiogenic stimuli are schematically shown in Fig. 1 . 

[0005] Fibroblast growth factors are also known to be involved in the regulation of angiogenesis. They have been 
shown to be mitogenic and chemotactic for cultured endothelial cells. Fibroblast growth factors also stimulate the 

35 production of proteases, such as collagenases and plasminogen activators, and induce tube formation by endothelial 
cells. Saksela et al., Ann. Rev. Cell Biol., 4:93-126 (1 988). There are two general classes of fibroblast growth factors, 
FGF-1 and FGF-2, both of which lack conventional signal peptides. Both types have an affinity for heparin and FGF- 
2 is bound to heparin sulfate proteoglycans in the subendothelial extracellular matrix from which it may be released 
after injury. Heparin potentiates the stimulation of endothelial cell proliferation by angiogenic FGFs, both by protecting 

40 against denaturation and degradation and dimerizing the FGFs. Cultured endothelial cells express the FGF-1 receptor 
but no significant levels of other high-affinity fibroblast growth factor receptors. 

[0006] Among other ligands for receptor tyrosine kinases, the platelet derived growth factor, PDGF-BB, has been 
shown to be weakly angiogenic in the chick chorioallantoic membrane. Risau etai, Growth Factors, 7:261-266 (1992). 
Transforming growth factor a (TGFa) is an angiogenic factor secreted by several tumor cell types and by macrophages. 

45 Hepatocyte growth factor (HGF), the ligand of the c-met proto-oncogene-encoded receptor, also is strongly angiogenic. 
[0007] Recent evidence shows that there are endothelial cell specific growth factors and receptors that may be 
primarily responsible for the stimulation of endothelial cell growth, differentiation and certain differentiated functions. 
The best studied of these is vascular endothelial growth factor (VEGF), a member of the PDGF family. Vascular en- 
dothelial growth factor is a dimeric glycoprotein of disulfide-linked 23 kD subunits. Other reported effects of VEGF 

50 include the mobilization of intracellular calcium, the induction of plasminogen activator and plasminogen activator in- 
hibitor-1 synthesis, stimulation of hexose transport in endothelial cells, and promotion of monocyte migration in vitro. 
Four VEGF isoforms, encoded by distinct mRNA splice variants, appear to be equally capable of stimulating mitogen- 
esis in endothelial cells. However, each isoform has a different affinity for cell surface proteoglycans, which behave as 
low affinity receptors for VEGF. The 1 21 and 1 65 amino acid isoforms of VEGF (VEGF1 21 and VEGF1 65) are secreted 

55 in a soluble form, whereas the isoforms of 1 89 and 206 amino acid residues remain cell surface-associated and have 
a strong affinity for heparin. VEGF was originally purified from several sources on the basis of its mitogenic activity 
toward endothelial cells, and also by its ability to induce microvascular permeability, hence it is also called vascular 
permeability factor (VPF). 
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[0008] The pattern of VEGF expression suggests its involvement in the development and maintenance of the normal 
vascular system and in tumor angiogenesis. During murine development, the entire 7.5 day post-coital (p.c.) endoderm 
expresses VEGF and the ventricular neuroectoderm produces VEGF at the capillary ingrowth stage. See Breier etai, 
Development, 1 74:521 -523 (1 992). On day two of quail development, the vascularized area of the yolk sac as well as 
the whole embryo show expression of VEGF. In addition, epithelial cells next to fenestrated endothelia in adult mice 
show persistent VEGF expression, suggesting a role in the maintenance of this specific endothelial phenotype and 
function. 

[0009] Two high affinity receptors for VEGF have been characterized. These are VEGFR-1/FIM (fms-like tyrosine 
kinase-1) and VEGFR-2/Kdr/Flk-1 (kinase insert domain containing receptor/fetal liver kinase-1). Those receptors are 
classified in the PDGF-receptor family, but they have seven rather than five immunoglobulin-like loops in their extra- 
cellular domain (see Fig. 1 ) and they possess a longer kinase insert than normally observed in this family. The expres- 
sion of VEGF receptors occurs mainly in vascular endothelial cells, although some may be present on hematopoietic 
progenitor cells, monocytes, and melanoma cells. Only endothelial cells have been reported to proliferate in response 
to VEGF, and endothelial cells from different sources show different responses. Thus, the signals mediated through 
VEGFR-1 and VEGFR-2 appear to be cell type specific. The VEGF-related placenta growth factor (PIGF) was recently 
shown to bind to VEGFR-1 with high affinity. PIGF was able to enhance the growth factor activity of VEGF, but it did 
not stimulate endothelial cells on its own. Naturally occurring VEGF/PIGF heterodimers were nearly as potent mitogens 
as VEGF homodimers for endothelial cells. Cao etai, J. Biol. Chem., 277:3154-62 (1996). 

[0010] The Flt4 receptor tyrosine kinase (VEGFR-3) is closely related in structure to the products of the VEGFR-1 
and VEGFR-2 genes. Despite this similarity, the mature form of Flt4 differs from the VEGF receptors in that it is pro- 
teolytically cleaved in the extracellular domain into two disulfide-linked polypeptides. Pajusola etal., Cancer Res., 52: 
5738-5743 (1 992). The 4.5 and 5.8 kb Flt4 mRNAs encode polypeptides which differ in their C-termini due to the use 
of alternative 3' exons. Isoforms of VEGF or PIGF do not show specific binding to Flt4 or induce its autophosphorylation. 
[001 1] Expression of Flt4 appears to be more restricted than the expression of VEGFR-1 or VEGFR-2. The expres- 
sion of Flt4 first becomes detectable by in situ hybridization in the angioblasts of head mesenchyme, the cardinal vein, 
and extraembryonically in the allantois of 8.5 day p.c. mouse embryos. In 12.5 day p.c. embryos, the Flt4 signal is 
observed in developing venous and presumptive lymphatic endothelia, but arterial endothelia appear negative. During 
later stages of development, Flt4 mRNA becomes restricted to developing lymphatic vessels. The lymphatic endothelia 
and some high endothelial venules express Flt4 mRNA in adult human tissues and increased expression occurs in 
lymphatic sinuses in metastatic lymph nodes and in lymphangioma. These results support the theory of the venous 
origin of lymphatic vessels. 

[0012] Five endothelial cell specific receptor tyrosine kinases, Flt-1 (VEGFR-1), KDR/Flk-1 (VEGFR-2), Flt4 (VEGFR- 
3), Tie, and Tek/Tie-2 have so far been described, which possess the intrinsic tyrosine kinase activity essential for 
signal transduction. Targeted mutations inactivating Flt-1, Flk-1, Tie, and Tek in mouse embryos have indicated their 
essential and specific roles in vasculogenesis and angiogenesis at the molecular level. VEGFR-1 and VEGFR-2 bind 
VEGF with high affinity (K d 16 pM and 760 pM, respectively) and VEGFR-1 also binds the related placenta growth 
factor (PIGF; K d about 200 pM). A ligand for Tek is reported in PCT patent publication WO 96/11269. 

SUMMARY OF THE INVENTION 

[0013] The present invention provides a ligand, designated VEGF-C, for the Flt4 receptor tyrosine kinase (VEGFR- 
3). Thus, the invention provides a purified and isolated polypeptide which is capable of binding to the Flt4 receptor 
tyrosine kinase. In one embodiment, the invention provides a purified and isolated polypeptide that is capable of binding 
to Flt4 receptor tyrosine kinase (Flt4), with an amino acid sequence comprising a portion of amino acid sequence 
103-419 of SEQ ID NO:33, wherein that portion is capable of binding to Flt4 and the polypeptide does not include an 
amino acid sequence corresponding to amino acids 94-41 9 of SEQ ID NO:33 in which a Gin residue has been substi- 
tuted for the Lys residue at position 414 in SEQ ID NO:33. 

[001 4] Preferably, the polypeptide has a portion of the amino acid sequence encloded by the coding insert of plasmid 
pFLT4-L (deposited under ATCC accession number 97231). 

[0015] In another embodiment, an FLT4 ligand polypeptide has a molecular weight of approximately 23kD as deter- 
mined by SDA-PAGE under reducing conditions and which is purifyable from conditioned media from a PC-3 prostatic 
adenocarcinoma cell line, the cell line having ATCC Acc. No. CRL 1435. Amino acid sequencing of this PC-3 cell 
derived ligand polypeptide revealed that the ligand polypeptide comprises an amino terminal amino acid sequence set 
forth in SEQ I D NO: 1 3. A conditioned medium comprising an Flt4 ligand is itself an aspect of the invention. The present 
invention also provides a new use for the PC-3 prostatic adenocarcinoma cell line which produces an Flt4 ligand. In a 
preferred embodiment, the ligand may be purified and isolated directly from the PC-3 cell culture medium. 
[0016] In a highly preferred embodiment, the ligand polypeptide comprises a fragment of the amino acid sequence 
shown in SEQ ID NO: 33 which specifically binds to the human Flt4 receptor tyrosine kinase. Exemplary fragments 
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include: a polypeptide comprising an amino acid sequence set forth in SEQ ID NO: 33 from residue 112 to residue 
213; a polypeptide comprising an amino acid sequence from residue 104 to residue 227 of SEQ ID NO: 33; and a 
polypeptide comprising an amino acid sequence from residue 112 to residue 227 of SEQ ID NO: 33. Other exemplary 
fragments include polypeptides comprising amino acid sequences of SEQ ID NO: 33 that span, the following residues: 
5 31-213,31-227,32-227, 103-217, 103-225, 104-213, 113-213, 103-227, 113-227, 131-211, 161-211, 103-225,31-419, 
as described in greater detail below. 

[0017] A putative VEGF-C precursor or splice variant, consisting of polypeptides with molecular weights of about 29 
and 32 kD, also is considered an aspect of the invention. 

[0018] In another embodiment, an expressed Flt4 ligand polypeptide precursor is proteolytically cleaved upon ex- 
10 pression to produce an approximately 21 kD VEGF-C polypeptide. Sequence analysis has indicated that an observed 
21 kD form has an amino terminus approximately 9 amino acids downstream from the amino terminus of the 23 kD 
form, suggesting that alternative cleavage sites exist. 

[0019] From the foregoing, it will be apparent that an aspect of the invention includes a fragment of the purified and 
isolated polypeptide having the amino acid sequence of residues 1 to 41 9 shown in SEQ ID NO: 33, the fragment being 
15 capable of specifically binding to Flt4 receptor tyrosine kinase. Preferred embodiments include fragments having an 
apparent molecular weight of approximately 21/23 kD and 29/32 kD as assessed by SDS-PAGE under reducing con- 
ditions. 

[0020] Evidence suggests that the amino acids essential for retaining Flt4 ligand activity are contained within ap- 
proximately amino acids 1 03/112-226/227 of SEQ ID NO: 33, and that a carboxy-terminal proteolytic cleavage to pro- 

20 duce a mature, naturally-occurring, Flt4 ligand occurs at the approximate position of amino acids 226-227 of SEQ ID 
NO: 33. Accordingly, a preferred Flt4 ligand comprises approximately amino acids 103-227 of SEQ ID NO: 33. 
[0021] VEGF-C mutational analysis described herein indicates that a naturally occurring VEGF-C polypeptide span- 
ning amino acids 103-227 of SEQ ID NO: 33, produced by a natural processing cleavage that defines the C-terminus, 
exists and is biologically active as an Flt4 ligand. A polypeptide fragment consisting of residues 104-213 of SEQ ID 

25 NO: 33 has been shown to retain VEGF-C biological activity. Additional mutational analyses indicate that a polypeptide 
spanning only amino acids 113-213 of SEQ ID NO: 33 retains Flt4 ligand activity. Accordingly, preferred polypeptides 
comprise sequences spanning, approximately, amino acid residues 103-227, 104-213, or 113-213, of SEQ ID NO: 33. 
[0022] Moreover, sequence comparisons of members of the VEGF family of polypeptides provide an indication that 
still smaller fragments will retain biological activity, and such smaller fragments are intended as aspects of the invention. 

30 in particular, eight highly conserved cysteine residues of the VEGF family of polypeptides define a region from residue 
131 to residue 211 of SEQ ID NO: 33 (see Figures 10 & 31 ); therefore, a polypeptide spanning from about residue 131 
to about residue 211 is expected to retain VEGF-C biological activity. In fact, a polypeptide comprising approximately 
residues 161-211, which retains an evolutionarily-conserved RCXXCC motif, is postulated to retain VEGF-C activity, 
and therefore is intended as an aspect of the invention. Some of the conserved cysteine residues in VEGF-C participate 

35 in interchain disulfide bonding to make homo- and heterodimers of the various naturally occurring VEGF-C polypep- 
tides. Beyond the preceding considerations, evidence exists that VEGF-C polypeptides lacking interchain disulfide 
bonds retain VEGF-C biological activity. 

[0023] The invention also includes multimers (including dimers) comprising such fragments linked to each other or 
to other polypeptides. Fragment linkage may be by way of covalent bonding (e.g., disulfide bonding) or non-covalent 

40 bonding of polypeptide chains (e.g, hydrogen bonding, bonding due to stable or induced dipole-dipole interactions, 
bonding due to hydrophobic or hydrophilic interactions, combinations of these bonding mechanisms, and the like). 
[0024] The present invention also provides purified and isolated polynucleotides (i.e., nucleic acids) encoding 
polypeptides of the invention, for example a cDNA or corresponding genomic DNA encoding a VEGF-C precursor, 
VEGF-C, or biologically active fragments or variants thereof, wherein the nucleic acid molecule is not a cDNA molecule 

45 with the DNA sequence set out in entry HS991157, accession number H07991 .1 in the EMBL database. 

[0025] A preferred polynucleotide according to the invention comprises the human VEGF-C cDNA sequence set 
forth in SEQ ID NO: 32 from nucleotide 352 to 1608. 

[0026] In a further embodiment of the invention, a nucleic acid molecule is provided, comprising or consisting of a 
VEGF-C encoding insert of plasmid pFLT4-L, deposited under accession number 97231 . 

50 [0027] Additional aspects of the invention include vectors which comprise nucleic acids of the invention; and host 
cells transformed or transfected with nucleic acids or vectors of the invention. Preferred vectors of the invention are 
expression vectors wherein nucleic acids of the invention are operatively connected to appropriate promoters and other 
control sequences, such that appropriate host cells transformed or transfected with the vectors are capable of express- 
ing the Flt4 ligand. A preferred vector of the invention is plasmid pFLT4-L, having ATCC accession no. 97231. Such 

55 vectors and host cells are useful for recombinantly producing VEGF-C polypeptides. 

[0028] The invention further includes a method of making polypeptides of the invention. In a preferred method, a 
nucleic acid or vector of the invention is expressed in a host cell, and a polypeptide of the invention is purified from the 
host cell or the host cell's growth medium. 
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[0029] In a related embodiment, the invention includes a method of making a polypeptide capable of specifically 
binding to Flt4 receptor tyrosine kinase, comprising the steps of: (a) transforming or transfecting a host cell with a 
nucleic acid of the invention; (b) cultivating the host cell to express the nucleic acid; and (c) purifying a polypeptide 
capable of specifically binding to Flt4 receptor tyrosine kinase from the host cell or from the host cell's growth media. 
[0030] The invention also is intended to include purified and isolated polypeptide ligands of Flt4 produced by methods 
of the invention. In one preferred embodiment, the invention includes a human VEGF-C polypeptide or biologically 
active fragment thereof that is substantially free of other human polypeptides. 

[0031] In another aspect, the invention includes an antibody which is specifically reactive with polypeptides of the 
invention, or with polypeptides multimers of the invention. Antibodies, both monoclonal and polyclonal, may be made 
against a polypeptide of the invention according to standard techniques in the art. Such antibodies may be used in 
diagnostic applications to monitor angiogenesis, vascularization, lymphatic vessels and their disease states, wound 
healing, or certain tumor cells, hematopoietic, or leukemia cells. The antibodies also may be used to block the ligand 
from activating the Flt4 receptor. 

[0032] Ligands according to the invention may be labeled with a detectable label and used to identify their corre- 
sponding receptors in situ. Labeled Flt4 ligand and anti-Flt4 ligand antibodies may be used as imaging agents in the 
detection of lymphatic vessels, high endothelial venules and their disease states, and Flt4 receptors expressed in 
histochemical tissue sections. The ligand or antibody may be covalently or non-covalently coupled to a suitable super- 
magnetic, paramagnetic, electron dense, echogenic, or radioactive agent for imaging. Other, non-radioactive labels, 
such as biotin and avidin, may also be used. 

[0033] A related aspect of the invention is a method for the detection of specific cells, e.g., endothelial cells. These 
cells may be found in vivo, or in ex vivo biological tissue samples. The method of detection comprises the steps of 
exposing a biological tissue comprising, e.g., endothelial cells, to a polypeptide according to the invention which is 
capable of binding to Flt4 receptor tyrosine kinase, under conditions wherein the polypeptide binds to the cells, option- 
ally washing the biological tissue, and detecting the polypeptide bound to the cells in the biological tissue, thereby 
detecting the cells. 

[0034] The present invention also provides diagnostic and clinical applications for claimed ligands. In a preferred 
embodiment, Flt4 ligands or precursors are used to accelerate angiogenesis, e.g., during wound healing, or to promote 
the endothelial functions of lymphatic vessels. A utility for VEGF-C is suggested as an inducer of angiogenesis also 
in tissue transplantation, in eye diseases, in the formation of collateral vessels around arterial stenoses and into injured 
tissues after infarction. Polypeptides according to the invention may be administered in any suitable manner using an 
appropriate pharmaceutically-acceptable vehicle, e.g., a pharmaceutically-acceptable diluent, adjuvant, excipient or 
carrier. VEGF-C polypeptides also may be used to quantify future metastatic risk by assaying biopsy material for the 
presence of active receptors or ligands in a binding assay or kit using detectably-labeled VEGF-C. An Flt4 ligand 
according to the invention also may be used to promote re-growth or permeability of lymphatic vessels in, for example, 
organ transplant patients. In addition, an Flt4 ligand may be used to mitigate the loss of axillary lymphatic vessels 
following surgical interventions in the treatment of cancer (e.g., breast cancer). Ligands according to the invention also 
may be used to treat or prevent inflammation, edema, aplasia of the lymphatic vessels, lymphatic obstruction, elephan- 
tiasis, and Milroy's disease. Finally, Flt4 ligands may be used to stimulate lymphocyte production and maturation, and 
to promote or inhibit trafficking of leukocytes between tissues and lymphatic vessels or to affect migration in and out 
of the thymus. 

[0035] The invention includes a method of screening for an endothelial cell disorder in a mammalian subject. The 
method comprises providing a sample of endothelial cells from the subject, contacting the sample of endothelial cells 
with a polypeptide according to claim 4, determining the growth rate of the cells, and correlating the growth rate with 
a disorder. In a preferred embodiment, the endothelial cells are lymphatic endothelial cells. In another preferred em- 
bodiment, the mammalian subject is a human being and the endothelial cells are human cells. In yet another preferred 
embodiment, the disorder is a vessel disorder, e.g., a lymphatic vessel disorder, such as the loss of lymphatic vessels 
through surgery or the reduction in function of existing lymphatic vessels due to blockages. In another embodiment, 
the endothelial cells are contacted with the polypeptide in vitro. The growth rate determined in the method is the rate 
of cell division per unit time, determined by any one of a number of techniques known in the art. The correlation of the 
growth rate with a disorder can involve a positive or negative correlation, e.g., whether the polypeptide has Flt4 ligand 
activity or is an inhibitor of such activity, as described below. 

[0036] Inhibitors of the Flt4 ligand may be used to control endothelial cell proliferation, lymphangiomas, and meta- 
static cancer. For example, such inhibitors may be used to arrest metastatic growth or spread, or to control other 
aspects of endothelial cell expression and growth. Inhibitors include antibodies, antisense oligonucleotides, and 
polypeptides which block the Flt4 receptor, all of which are intended as aspects of the invention. 
[0037] In another embodiment, the invention provides a method for modulating the growth of endothelial cells in a 
mammalian subject comprising the steps of exposing mammalian endothelial cells to a polypeptide according to the 
invention in an amount effective to modulate the growth of the mammalian endothelial cells. In one embodiment, the 
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modulation of growth is effected by using a polypeptide capable of stimulating tyrosine phosphorylation of Flt4 receptor 
tyrosine kinase in a host cell expressing the Flt4 receptor tyrosine kinase. In modulating the growth of endothelial cells, 
the invention contemplates the modulation of endothelial cell-related disorders. Endothelial cell disorders contemplated 
by the invention include, but are not limited to, physical loss of lymphatic vessels (e.g., surgical removal of axillary 

5 lymph tissue), lymphatic vessel occlusion (e.g., elephantiasis), and lymphangiomas. In a preferred embodiment, the 
subject, and endothelial cells, are human. The endothelial cells may be provided in vitro or in vivo, and they may be 
contained in a tissue graft. An effective amount of a polypeptide is defined herein as that amount of polypeptide em- 
pirically determined to be necessary to achieve a reproducible change in cell growth rate (as determined by microscopic 
or macroscopic visualization and estimation of cell doubling time, or nucleic acid synthesis assays), as would be un- 

10 derstood by one of ordinary skill in the art. 

[0038] Purified and isolated polynucleotides encoding other (non-human) VEGF-C forms also are aspects of the 
invention, as are the polypeptides encoded thereby, and antibodies that are specifically immunoreactive with the non- 
human VEGF-C variants. Preferred non-human forms of VEGF-C are forms derived from other vertebrate species, 
including avian and mammalian species. Mammalian forms are highly preferred. Thus, the invention includes a purified 

15 and isolated mammalian VEGF-C polypeptide, and also a purified and isolated polynucleotide encoding such a polypep- 
tide. 

BRIEF DESCRIPTION OF THE DRAWING 
20 [0039] 

Figure 1 schematically depicts major endothelial cell receptor tyrosine kinases and growth factors involved in 
vasculogenesis and angiogenesis. Major structural domains are depicted, including immunoglobulin-like domains 
(IGH), epidermal growth factor homology domains (EGFH), fibronectin type III domains (FN III), transmembrane 
25 (TM) and juxtamembrane (JM) domains, tyrosine kinase (TK1, TK2) domains, kinase insert domains (Kl), and 

carboxy-terminal domains (CT) . 

Figure 2 schematically depicts the construction of the pLTRFIt4l expression vector. 

Figure 3 schematically depicts the construction of the baculovirus vector encoding a secreted soluble Flt4 extra- 
cellular domain (Flt4EC). 

30 Figure 4 shows results of stimulation of Flt4 autophosphorylation by conditioned medium from PC-3 cell cultures. 

Figures 5A, 5B, and 5C show that the major tyrosyl phosphorylated polypeptide of Flt4-transfected cells stimulated 
with PC-3 conditioned medium is the 1 25 kD Flt4 polypeptide (VEGFR-3), and also that the Flt4 stimulating activity 
is not adsorbed to heparin-sepharose. 

Figure 6 shows Western immunoblotting analysis of the Flt4 ligand activity isolated from PC-3 conditioned medium. 
35 Figure 7 shows results of gel electrophoresis of chromatographic fractions from the affinity purification of Flt4 ligand 

(VEGF-C) isolated from concentrated PC-3 conditioned medium. 

Figure 8 depicts the amino acid sequences of human, murine, and quail VEGF-C polypeptides, aligned to show 
similarity. Residues conserved in all three species are depicted in bold. 

Figure 9 schematically depicts the cloning and structure of the Flt4 ligand, VEGF-C. The VEGF homologous region 
40 (dark shaded box) and amino and carboxyl terminal propeptides (light shaded and unshaded boxes, respectively) 

as well as putative signal sequence (ss) are depicted between 5' and 3' untranslated (ut) nucleic acid regions. The 

cleavage sites for the signal sequence and the amino and carboxyl terminal propeptides are indicated with triangles. 

Figure 10 shows a comparison of the deduced amino acid sequences of PDGF-A, -B, PIGF-1, VEGF-B 167 , 

VEGF1 65, and Flt4 ligand (VEGF-C). 
45 Figure 11 depicts the exon-intron organization of the human VEGF-C gene. Seven exons are depicted as open 

boxes, with exon size depicted in base pairs. Introns are depicted as lines, with intron size (base pairs) depicted 

above the lines. 5' and 3' untranslated sequences of a putative 2.4 kb mature mRNA are depicted as shaded boxes. 

The location of genomic clones used to characterize the VEGF-C gene are depicted below the map of the gene. 

Figure 12 shows Northern blotting analysis of the genes encoding VEGF, VEGF-B, AND VEGF-C (indicated by 
50 "FLT4-L") in two human tumor cell lines and in brain tissue. 

Figure 13A is an auto radiograph showing recombinant VEGF-C isolated following a pulse-chase experiment and 

electrophoresed via SDS-PAGE under reducing conditions. 

Figure 13B is a photograph of polyacrylamide gel showing that recombinant VEGF-C forms are disulfide-linked in 
nonreducing conditions. 

55 Figure 14A and 14B depict Western blots showing that VEGF-C stimulates autophosphorylation ofVEGFR-2 (KDR) 

but has no effect on PDGFR-p phosphorylation. 

Figure 15A shows that VEGF-C stimulates endothelial cell migration in a three-dimensional collagen gel assay. 
Figure 16A shows the expression of VEGF-C mRNA in human adult tissues. 
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Figure 16B shows the expression of VEGF, VEGF-B, and VEGF-C in selected human fetal tissues. 
Figure 17 depicts the genomic structure of the human and murine VEGF-C genes. Sequences of exon-intron 
junctions are depicted together with exon and intron lengths. Intron sequences are depicted in lowercase letters. 
Nucleotides of the open reading frame observed in VEGF-C cDNAs are indicated as upper case letters in triplets 

5 (corresponding to the codons encoded at the junctions). 

Figure 18 presents a schematic illustration of VEGF-C processing, including the major forms of VEGF-C. 
Figure 19 depicts autoradiograms from a pulse-chase immunoprecipitation experiment wherein cells transfected 
with a VEGF-C expression vector (VEGF-C) and mock transfected cells (M) were pulse-labeled with radioactive 
amino acids and chased for varying lengths of time. 

10 Figure 20 is a schematic map of the K14-VEGF-C vector construct. 

Figures 21A-C depict electrophoretic fractionations of the various forms of recombinant VEGF-C produced by 
transfected 293 EBNA cells. Figure 21 B depicts the electrophoretic fractionation, under non-reducing conditions, 
of polypeptides produced from mock (M) transfected cells, cells transfected with wild type (wt) VEGF-C cDNA, 
and cells transfected with acDNA variant encoding VEGF-C-R1 02S. Each of the bands identified in Figure 21 B 

15 was excised and electrophoretically fractionated in a separate lane under reducing conditions. Fractionation of 

bands corresponding to wt VEGF-C are depicted in Figure 21 A; fractionation of bands corresponding to the R1 02S 
variant are depicted in Figure 21 C. 

Figures 22A-B depict the forms and sizes of wild type and mutant recombinant VEGF-Cs, as revealed by non- 
reducing gel electrophoresis. Figure 22A shows the VEGF-C forms secreted into the media; Figure 22B shows 
20 the VEGF-C forms retained by the cells. Mock (M) transfected cells served as a control. 

Figures 23A-B present a comparison of the pattern of immunoprecipitated, labelled VEGF-C forms using antisera 
882 and antisera 905. Adjacent lanes contain immunoprecipitates that were (lanes marked +) or were not (lanes 
marked -) subjected to reduction and alkylation. 

Figures 24A-B present Northern blots of total RNA isolated from cells grown in the presence or absence of inter- 
ns leukin-1 (IL-1 ) and/or dexamethasone (DEX) for the indicated times. For Figure 24B, the Northern blot was probed 
with radiolabeled DNA from a VEGF 581 bp cDN A covering bps 57-638 (Genbank Acc. No. X1 5997), and a human 
VEGF-B 167 cDNA fragment (nucleotides 1/382, Genbank Acc. No. U48800). For Figure 24A, the Northern blot 
was probed with radiolabeled DNA from a human full-length VEGF-C cDNA (Genbank Acc. No. X9421 6). 1 8S and 
28S- rRNA markers. 

30 Figure 25 shows VEGF-C expression in P pastoris cultures transfected with a VEGF-C cDNA, with vector alone, 

or mock- (M) transfected, following induction with methanol for various periods of time as indicated. About 10 julI 
of medium was analyzed by gel electrophoresis followed by Western blotting and detection with anti-VEGF-C 
antiserum. 

Figure 26 depicts the results of a Western blot wherein NIH 3T3 cells expressing VEGFR-3 (Flt4), and PAE cells 
35 expressing VEGFR-2 (KDR), were stimulated with 5x concentrated medium from Pichia yeast transfected with a 

VEGF-C cDNA-containing vector (+), with a vector lacking an insert (-), or stimulated with the positive control 
vanadate. The stimulated cells were lysed and immunoprecipitated with VEGFR-specific antibodies, and the im- 
munoprecipitates were blotted and probed with anti-phosphotyrosine antibodies. 

Figures 27A-B present gel electrophoretograms of human VEGF-C (wt) and VEGF-C variants secreted (Figure 
40 27 A) or retained (Figure 27B) by the host 293 EBNA cells. Mock (M) transfected cells served as a control. Molecular 

weight markers are indicated on the left in kilodaltons (kD). 

Figures 28A-B show Western blots of VEGFRs that were stimulated to autophosphorylate by wild type (wt) VEGF-C, 
as well as three VEGF-C polypeptide variants. Cell lysates (NIH 3T3 for VEGFR-3 and PAE for VEGFR-2) were 
subjected to receptor-specific antisera and the receptors were immunoprecipitated. Immunoprecipitates were then 
45 gel-fractionated and blotted for Western analyses. Western blots were probed with anti-phosphotyrosine antibod- 

ies. 

Figures 29A-D are photomicrographs of hematoxylin-eosin stained sections of K1 4-VEGF-C transgenic and control 
mouse littermate tissues. Areas shown are from the dorsal skin and snout, as indicated. The white arrows show 
the endothelium-lined margin of the lacunae devoid of red cells. 

50 Figure 30 presents a Northern blot of polyadenylated RNA from the indicated tissues, hybridized with a pool of 

VEGF, VEGF-B 167 , and VEGF-C probes. Estimated transcript sizes are shown on the right in kilobases (kb). 
Figure 31 presents a comparison of the human and mouse VEGF-C amino acid sequences. The amino acid se- 
quence of mouse VEGF-C is presented on the top line and differences in the human sequence are marked below 
it. The sequences have been labeled to depict the regions shown in Figure 9. The arrow indicates the putative 

55 cleavage site for the signal peptidase; BR3P motifs, as well as a CR/SC motif, are boxed; and conserved cysteine 

residues are marked in bold above the sequence. Arginine residue 158 is also marked in bold. The numbering 
refers to mouse VEGF-C residues. 

Figure 32 presents SDS-PAGE-fractionated samples immunoprecipitated or affinity-purified from various 35 S-la- 
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beled media. In the left panel, control medium from Bosc23 cells containing vector only, medium from cells ex- 
pressing human VEGF-C, and medium from cells expressing mouse VEGF-C were independently precipitated 
with human VEGFR-3-Extracellular Domain coupled to sepharose. In the right panel, similar conditioned media 
were subjected to precipitation with anti-VEGF-C antibodies, mwm: molecular weight markers; m- mouse; h-hu- 
5 man; a- anti. 

Figure 33 shows Western blots of gel-fractionated immunoprecipitates from lysates made from NIH 3T3 cells ex- 
pressing VEGFR-3 or VEGFR-2, as indicated, that had been stimulated by contact with VEGF-C-containing lysates 
(or a vector control), as a measure of VEGF-C-induced receptor autophosphorylation. Western blots were probed 
with anti-phosphotyrosine (oc-PTyr) or anti-receptor antisera (anti-VEGFR-3 and anti-VEGFR-2), as indicated. As 
10 a control, receptor autophosphorylation was induced by pervanadate treatment (V04). The arrows and numbers 

refer to the apparent molecular weights of the tyrosyl phosphorylated receptor polypeptide bands. bVEGF: human 
baculoviral VEGF-C protein; C-FGEVm: lysate from cells harboring a mouse VEGF-C cDNA cloned into the vector 
in an antisense orientation. 

Figures 34A-D depict photomicrographs of in situ hybridizations revealing the expression of VEGF-C and VEGF-B 

15 mRNAs in a parasagittal section of a 12.5 day mouse embryo. Figure 34A: VEGF-C probe; j- jugular veins, mn- 

metanephros, m- mesenterium (arrowheads), vc-intervertebral vessels, lu- lung (arrowheads). Figure 34B: 
VEGF-B probe; h- heart, nasopharyngeal area (arrowheads). Figure 34C: VEGF-C sense strand probe serving as 
a control. Figure 34D: bright-field photomicrograph of the same field shown in Figure 34C. 
Figures 35A-H depict sections of mouse embryos providing comparisons of VEGF-C and VEGFR-3 expression in 

20 the jugular vessels and the mesenteric area. Figures 35A and 35C show expression of VEGF-C transcripts in the 

mesenchyme around the large sac-like structures in the jugular area (arrowheads). Figures 35B and 35D show 
expression of VEGFR-3 transcripts in the jugular venous sacs. Figures 35E and 35G show VEGF-C mRNA distri- 
bution in the mesenteric region of a 14.5 day p.c. embryo, as well as around the gut. Figures 35F and 35H show 
VEGFR-3 mRNA in the mesenteric region of a 14.5 day embryo, as well as the gut area, developing lymphatic 

25 vessels, and venules. 

Figures 36A-D depict photomicrographs showing FLT4 and VEGF-C in situ hybridization of the cephalic region of 
a 16-day p.c. mouse embryo. A section of the cephalic region hybridized with the Flt4 probe (Figure 36A) shows 
the developing snout, nasal structures and eyes. A more caudally located section shows hybridization with the 
VEGF-C probe (Figure 36B). The round structures on both sides in the upper part represent the developing molars. 

30 in the upper (dorsal) part on both sides of the midline, the caudal portion of the developing conchae are seen. 

These structures also are shown in higher magnification darkfield (Figure 36C) and lightfield (Figure 36D) micro- 
scopy. 

DETAILED DESCRIPTION OF THE INVENTION 

35 

[0040] Described herein is the isolation of a novel vascular endothelial growth factor and the cloning of a DNA en- 
coding this novel growth factor from acDNA library prepared from the human prostatic adenocarcinoma cell line PC- 
3. The isolated cDNA encodes a protein which is proteolytically processed and secreted to cell culture medium. The 
secreted protein, designated VEGF-C, binds to the extracellular domain of Flt4 and induces tyrosine autophosphor- 

40 ylation of Flt4 and VEGFR-2. VEGF-C also stimulates the migration of endothelial cells in collagen gel. 

[0041] Data reported herein indicates that VEGF-C is expressed as a larger precursor which is cleaved to produce 
the ligand. A coexpressed region in some cases results from alternative splicing of RNA of the ligand gene. Such a 
co-expressed region may be a function of the particular expression system used to obtain the ligand. The skilled artisan 
understands that in recombinant production of proteins, additional sequence may be expressed along with a functional 

45 polypeptide depending upon the particular recombinant construct used to express the protein, and subsequently re- 
moved to obtain the desired ligand. In some cases the recombinant ligand can be made lacking certain residues of 
the endogenous/natural ligand. Moreover, it is well-known in that conservative replacements may be made in a protein 
which do not alter the function of the protein. 

[0042] Moreover, it is anticipated that one or more VEGF-C precursors (the largest putative native secreted VEGF-C 
50 precursor having the complete amino acid sequence from residue 32 to residue 41 9 of SEQ ID NO: 33) is capable of 
stimulating the Flt4 ligand without any further processing, in a manner similar to that in which VEGF stimulates its 
receptor in its unprocessed form after the secretion and concomitant release of the signal sequence. 
[0043] Results reported herein show that Flt4 (VEGFR-3) transmits signals for VEGF-C. This conclusion is based 
on the specific binding of VEGF-C to recombinant Flt4EC (Flt4 extracellular domain) protein and the induction of VEG- 
55 FR-3 autophosphorylation by medium from VEGF-C transfected cells. In contrast, neither VEGF nor PIGF showed 
specific binding to VEGFR-3 or induced its autophosphorylation. 

[0044] As set forth in greater detail below, the putative prepro-VEGF-C has a deduced molecular mass of 46,883; a 
putative prepro-VEGF-C processing intermediate has an observed molecular weight of about 32 kD; and mature 
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VEGF-C isolated form conditioned media has a molecular weight of about 23 kD as assessed by SDS-PAGE under 
reducing conditions. A major part of the difference in the observed molecular mass of the purified and recombinant 
VEGF-C and the deduced molecular mass of the prepro-VEGF-C encoded by the VEGF-C open reading frame (ORF) 
is attributable to proteolytic removal of sequences at the amino-terminal and carboxyl-terminal regions of the pre- 

5 pro-VEGF-C polypeptide. However, proteolytic cleavage of the putative 1 02 amino acid leader sequence is not believed 
to account for the entire difference between the deduced molecular mass of 46,883 and the observed mass of about 
23 kD, because the deduced molecular weight of a polypeptide consisting of amino acids 1 03-41 9 of SEQ ID NO: 33 
is 35,881 kD. Evidence indicates that a portion of the observed difference in molecular weights is attributable to pro- 
teolytic removal of amino acid residues in both the amino and carboxyl terminal regions of the VEGF-C precursor. 

10 Extrapolation from studies of the structure of PDGF (Heldin et al., Growth Factors, 8:245-52 (1 993)) suggests that the 
region critical for receptor binding and activation by VEGF-C is contained within amino acids residues 104-213, which 
are found in the secreted form of the VEGF-C protein (i.e., the form lacking the putative prepro leader sequence and 
some carboxyterminal sequences). The 23 kD polypeptide binding VEGFR-3 is likely to represent the VEGF-homolo- 
gous domain. After biosynthesis, the nascent VEGF-C polypeptide may be glycosylated at three putative N-linked 

15 glycosylation sites identified in the deduced VEGF-C amino acid sequence. Polypeptides containing modifications, 
such as N-linked glycosylates, are intended as aspects of the invention. 

[0045] The carboxyl terminal amino acid sequences, which increase the length of the VEGF-C polypeptide in com- 
parison with other ligands of this family, show a pattern of spacing of cysteine residues reminiscent of the Balbiani ring 
3 protein (BR3P) sequence (Dignam etal., Gene, 88:133-40 (1990); Paulsson etaL, J. Mol. Biol., 21 7:331 -49 (1990)). 

20 This novel C-terminal silk protein-like structural motif of VEGF-C may fold into an independent domain, which, on the 
basis of the considerations above, is at least partially cleaved off after biosynthesis. Interestingly, at least one cysteine 
motif of the BR3P type is also found in the carboxyl terminus of VEGF. In our experiments both the putative precursor 
and cleaved ligand were detected in the cell culture media, suggesting cleavage by cellular proteases. The determi- 
nation of amino-terminal and carboxy-terminal sequences of VEGF-C isolates allows the identification of the proteolytic 

25 processing sites. The generation of antibodies against different parts of the pro-VEGF-C molecule allows the exact 
determination of the precursor-product relationship and ratio, their cellular distribution, and the kinetics of processing 
and secretion. 

[0046] VEGF-C has a conserved pattern of eight cysteine residues, which may participate in the formation of intra- 
and interchain disulfide bonds, creating an antiparallel dimeric biologically active molecule, similar to PDGF. Mutational 

30 analysis of the cysteine residues involved in the interchain disulfide bridges has shown that, in contrast to PDGF, VEGF 
dimers need to be held together by these covalent interactions in order to maintain biological activity. Disulfide linking 
of the VEGF-C polypeptide chains was evident in the analysis of VEGF-C in nonreducing conditions, although recom- 
binant protein also contained ligand-active VEGF-C forms which lacked disulfide bonds between the polypeptides. 
[0047] VEGFR-3, which distinguishes between VEGF and VEGF-C, is closely related in structure to VEGFR-1 and 

35 VEGFR-2. Finnerty et ai., Oncogene, 8:2293-98 (1993); Galland et al, Oncogene, 8:1233-40 (1993); Pajusola etal., 
Cancer Res., 52:5738-43 (1992). Besides VEGFR-3, VEGFR-2 tyrosine kinase also is activated in response to 
VEGF-C. VEGFR-2 mediated signals cause striking changes in the morphology, actin reorganization and membrane 
ruffling of porcine aortic endothelial cells overexpressing this receptor. In these cells, VEGFR-2 also mediated ligand- 
induced chemotaxis and mitogenicity. Waltenberger et al., J. Biol. Chem., 269:26988-95 (1994). Similarly, the receptor 

40 chimera CSF-1 R/VEGFR-3 was mitogenic when ectopically expressed in NIH 3T3 fibroblastic cells, but not in porcine 
aortic endothelial cells (Pajusola etal., 1 994). Consistent with such results, the bovine capillary endothelial (BCE) cells, 
which express VEGFR-2 mRNA but very little or no VEGFR-1 or VEGFR-3 mRNAs, showed enhanced migration when 
stimulated with VEGF-C. Light microscopy of the BCE cell cultures in collagen gel also suggested that VEGF-C stim- 
ulated the proliferation of these cells. The data thus indicate that the VEGF ligands and receptors show a great specificity 

45 in their signalling, which may be cell-type-dependent. 

[0048] The expression pattern of the VEGFR-3 (Kaipainen etal., Proc. Natl. Acad. Sci. (USA), 92:3566-70 (1995)) 
suggests that VEGF-C may function in the formation of the venous and lymphatic vascular systems during embryo- 
genesis. Constitutive expression of VEGF-C in adult tissues shown herein further suggests that this gene product also 
is involved in the maintenance of the differentiated functions of the lymphatic and certain venous endothelia where 

50 VEGFR-3 is expressed (Kaipainen et al., 1 995). Lymphatic capillaries do not have well-formed basal laminae and an 
interesting possibility remains that the silk-like BR3P motif is involved in producing a supramolecular structure which 
could regulate the availability of VEGF-C in tissues. However, as shown here, VEGF-C also activates VEGFR-2, which 
is abundant in proliferating endothelial cells of vascular sprouts and branching vessels of embryonic tissues, but not 
so abundant in adult tissues. Millauer etal., Nature, 367:576-78 (1993). These data have suggested that VEGFR-2 is 

55 a major regulator of vasculogenesis and angiogenesis. VEGF-C may thus have a unique effect on lymphatic endothe- 
lium and a more redundant function, shared with VEGF, in angiogenesis and possibly in regulating the permeability of 
several types of endothelia. Because VEGF-C stimulates VEGFR-2 and promotes endothelial migration, VEGF-C may 
be useful as an inducer of angiogenesis of blood and lymphatic vessels in wound healing, in tissue transplantation, in 
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eye diseases, and in the formation of collateral vessels around arterial stenoses and into injured tissues after infarction. 
[0049] Taken together, these results show an increased complexity of signalling in the vascular endothelium. They 
reinforce the concept that when organs differentiate and begin to perform their specific functions, the phenotypic het- 
erogeneity of endothelial cells increases in several types of functionally and morphologically distinct vessels. However, 
5 upon stimulation by suitable angiogenic stimuli, endothelial cells can re-enter the cell cycle, migrate, withdraw from 
the cell cycle and subsequently differentiate again to form new vessels that are functionally adapted to their tissue 
environment. This process of angiogenesis, concurrent with tissue development and regeneration, depends on the 
tightly controlled balance between positive and negative signals for endothelial cell proliferation, migration, differenti- 
ation and survival. 

w [0050] Previously-identified growth factors promoting angiogenesis include the fibroblast growth factors, hepatocyte 
growth factor/scatter factor, PDGF and TGF-oc. (See e.g., Folkman, Nature Med., 7:27-31 (1995); Friesel etaL, FASEB 
J., 9:919-25 (1995); Mustonen etaL, J. Cell. Biol., 729:895-98 (1995). However, VEGF has been the only growth factor 
relatively specific for endothelial cells. The newly identified factors VEGF-B [Olofsson etal., Proc. Natl. Acad. ScL, 93: 
2578-81 (1 996)] and VEGF-C thus increase our understanding of the complexity of the specific and redundant positive 

15 signals for endothelial cells involved in vasculogenesis, angiogenesis, permeability, and perhaps also other endothelial 
functions. Expression studies using Northern blotting show abundant VEGF-C expression in heart and skeletal muscle; 
other tissues, such as placenta, ovary, small intestine, thyroid gland, kidney, prostate, spleen, testis and large intestine 
also express this gene. Whereas PIGF is predominantly expressed in the placenta, the expression patterns of VEGF, 
VEGF-B and VEGF-C overlap in many tissues, which suggests that members of the VEGF family may form heterodim- 

20 ers and interact to exert their physiological functions. 

[0051] Targeted mutagenesis leading to inactivation of the VEGF receptor loci in the mouse genome has shown that 
VEGFR-1 is necessary for the proper organization of endothelial cells forming the vascular endothelium, while VEGFR- 
2 is necessary for the generation of both endothelial and hematopoietic cells. This suggests that the four genes of the 
VEGF family can be targets for mutations leading to vascular malformations or cardiovascular diseases. 

25 [0052] The following Examples illustrate preferred embodiments of the invention, wherein the isolation, characteri- 
zation, and function of Flt4 ligands and ligand-encoding nucleic acids according to the invention are shown. 

EXAMPLE 1 

30 Production of pLTRFIt4l expression vector 

[0053] Construction of the LTR-Flt41 vector, encoding the long form of Flt4 receptor tyrosine kinase, is schematically 
shown in Fig. 2. The full-length Flt4s (Flt4 short form) cDNA (Genbank Accession No. X68203, SEQ ID NO: 36) was 
assembled by first subcloning the S2.5 fragment, reported in Pajusola etal, Cancer Res., 52:5738-5743 (1992), in- 
35 corporated by reference herein, containing base pairs 56-2534 of the Flt4s into the EcoRI site of the pSP73 vector 
(Promega, Madison, Wl). 

[0054] Since cDN A libraries used for screening of Flt4 cDNAs did not contain the extreme 5' protein-coding sequenc- 
es, inverse PCR was used for the amplification of the 5' end of Flt4 corresponding to the first 12 amino acid residues 
(MQRGAALCLRLW). Poly(A) + RNA was isolated from human HEL erythroleukemia cells and double-stranded cDNA, 

40 were synthesized using an Amersham cDNA Synthesis System Plus kit (Amersham Corp., Buckinghamshire, U.K.) 
and a gene-specific primer: 5'-TGTCCTCGCTGTCCTTGTCT-3' (SEQ ID NO: 1), which was located 195 bp down- 
stream of the 5' end of clone S2.5. Double-stranded cDNA was treated with T4 DNA polymerase to blunt the ends and 
cDNA was purified by filtration with Centricon 1 00 filters (Amicon Inc., Beverly, MA). Circularization of the blunt-ended 
cDNA was performed by ligation in a total volume of 1 50 microliters. The reaction mixture contained a standard ligation 

45 buffer, 5% PEG-8000, 1 mM DTT and 8 U of T4 DNA ligase (New England Biolabs, Beverly, MA). Ligation was carried 
out at 16°C for 16 hours. Fifteen microliters of this reaction mix were used in a standard PCR reaction (100 jnl total 
volume) containing 100 ng of Flt4-specific primers introducing Sad and Pst\ restriction sites, and 1 unit of Taq DNA 
polymerase (Perkin Elmer Cetus). Two rounds of PCR were performed using 33 cycles per round (denaturation at 
95°C for 1 minute, annealing at 55°C for 2 minutes, and elongation at 72°C for 4 minutes). The PCR mixture was 

50 treated sequentially with the Sacl and Pstl restriction enzymes, and after purification with MagicPCR Preps (Promega), 
DNA fragments were subcloned into the pGEM3Zf(+) vector for sequencing (Promega). The sequence corresponded 
to the 5' end of the Flt4s cDNA clone deposited in the Genbank Database as Accession No. X68203. 
[0055] The sequence encoding the first 12 amino acid residues was added to the expression construct by ligating 
an Sphl-digested PCR fragment amplified using reverse transcription-PCR of poly(A) + RNA isolated from the HEL 

55 cells. The forward primer had the following sequence: 5'-ACAT GCATGC CACCATGCAG CGGGGCGCCG CGCTGT- 
GCCT GCGACTGTGG CTCTGCCTGG GACTCCTGGA-3' (SEQ ID NO: 2) (Sph\ site underlined, translational start 
codon marked in bold). The translation start codon is immediately downstream from an optimized Kozak consensus 
sequence. (Kozak, Nucl. Acids Res., 15: 8125-8148 (1987).) The reverse primer, 5'-ACAT GCATGC CCCGCCGGT 
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CATCC-3' (SEQ ID NO: 3) (Sph\ site underlined), to the 5' end of the S2.5 fragment, thus replacing the unique Sph\ 
fragment of the S2.5 plasmid. The resulting vector was digested with EcoRI and C/al and ligated to a 138 bp PCR 
fragment amplified from the 0.6 kb EcoRI fragment (base pairs 3789 to 441 6 in the Genbank X68203 sequence) which 
encodes the 3' end of Flt4s shown in Figure 1 of Pajusola et al., Cancer Res., 52:5738-5743 (1992), using the oligo- 

5 nucleotides S'-CG GAATTC CC CATGACCCCA AC-3' (SEQ ID NO: 4) (forward primer, EcoRI site underlined) and 
5'-CC ATCGAT GG ATCCTACCTG AAGCCGCTTT CTT-3' (SEQ ID NO: 5) (reverse primer, C/al site underlined). The 
coding domain was completed by ligation of the 1 .2 kb EcoRI fragment (base pairs 2535-3789 of the sequence found 
in Gen Bank Acc. No. X68203) into the above construct. The complete cDNA was subcloned as a H/nc/lll-C/al(blunted) 
fragment (this C/al site was also included in the 3' primer used to construct the 3' end of the coding sequence) to the 

10 pLTRpoly expression vector reported in Makelaef a/., Gene, 1 18: 293-294 (1992) (Genbank accession number X60280, 
SEQ ID NO: 37), incorporated by reference herein, using its /-//ndlll-Accl(blunted) restriction sites. 
[0056] The long form of Flt4 (Flt41 ) was produced by replacing the 3'-end of the short form as follows: The 3' region 
of the Flt41 cDNA was PCR-amplified using a gene-specific oligonucleotide (SEQ ID NO: 7, see below) and a pGEM 
3Z vector-specific (SP6 promoter) oligonucleotide 5'-ATTTAGGTGACACTATA-3' (SEQ ID NO: 6) as reverse and for- 

15 ward primers, respectively. The template for PCR was an Flt41 cDNA clone containing a 495 bp EcoRI fragment ex- 
tending downstream of the EcoRI site at nucleotide 3789 of the Genbank X68203 sequence (the sequence downstream 
of this EcoRI site is deposited as the Flt4 long form 3' sequence having Genbank accession number S66407 (SEQ ID 
NO: 38)). The gene-specific oligonucleotide contains a SamHI restriction site located right after the end of the coding 
region and has the following sequence: 5'-CCATCGA TGGATCC CGATGCTGCTTAGTAGCTGT-3'(SEQ ID NO: 7) 

20 (BamHI site is underlined). The PCR product was digested with EcoRI and SamHI and transferred in frame to the 
LTRFIt4s vector fragment from which the coding sequences downstream of the EcoRI site at base pair 2535 (see 
sequence X68203) had been removed by EcoRI-SamHI digestion. The resulting clone was designated pLTRFIt41. 
Again, the coding domain was completed by ligation of the 1 .2 kb EcoRI fragment (base pairs 2535-3789 of sequence 
X68203) back into the resulting construct. 

25 

EXAMPLE 2 

Production and analysis of Flt4l transfected cells 

30 [0057] NIH 3T3 cells (60 % confluent) were co-transfected with 5 micrograms of the pLTRFIt41 construct and 0.25 
micrograms of the pSV2neo vector containing the neomycin phosphotransferase gene (Southern et al., J. Mol. Appl. 
Genet, 7:327 (1982)), using the DOTAP liposome-based transfection reagents (Boehringer-Mannheim, Mannheim, 
Germany). One day after transfection, the cells were transferred into selection media containing 0.5 mg/ml geneticin 
(GIBCO, Grand Island, N.Y.). Colonies of geneticin-resistant cells were isolated and analyzed for expression of the 

35 Flt4 proteins. Cells were lysed in boiling lysis buffer containing 3.3% SDS 125 mM Tris, pH 6.8. Protein concentrations 
of the samples were measured by the BCA method (Pierce, Rockford, IL). About 50 micrograms of protein from each 
lysate were analyzed for the presence of Flt4 by 6% SDS-polyacrylamide gel electrophoresis (SDS-PAGE) and immu- 
noblotting using antisera against the carboxyl terminus of Flt4. Signals on Western blots were revealed using the ECL 
method (Amersham). 

40 [0058] For production of anti-Flt4 antiserum, the Flt4 cDNA fragment encoding the 40 carboxy-terminal amino acid 
residues of the short form: NH2-PMTPTTYKG SVDNQTDSGM VLASEEFEQI ESRHRQESGFR-COOH (SEQ ID NO: 
8) was cloned as a 657 bp EcoRI-fragment into the pGEX-IXT bacterial expression vector (Pharmacia-LKB, Inc., Upp- 
sala, Sweden) in frame with the glutathione-S-transferase coding region. The resultant GST-Flt4S fusion protein was 
produced in E. coli and purified by affinity chromatography using a glutathione-Sepharose 4B column. The purified 

45 protein was lyophilized, dissolved in phosphate-buffered saline (PBS), mixed with Freund's adjuvant and used for 
immunization of rabbits at bi-weekly intervals using methods standard in the art (Harlow etal., Antibodies: A Laboratory 
Manual (Cold Spring Harbor Laboratory Press, 1988)). Antisera were used, after the fourth booster immunization, for 
immunoprecipitation of Flt4 from transfected cells. Cell clones expressing Flt4 were also used for ligand stimulation 
analysis. 

50 

EXAMPLE 3 

Construction of a Flt4 EC baculovirus vector and expression and purification of its product 

55 [0059] The construction of an Flt4 extracellular domain (EC) baculovirus vector is schematically depicted in Fig. 3. 
The Flt4-encoding cDNA was prepared in both a long form and a short form, each being incorporated in a vector under 
control of the Moloney murine leukemia virus LTR promoter. The nucleotide sequence of the short form of the Flt4 
receptor is available from the Genbank database as Accession No. X68203 and the specific 3' segment of the long 
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form cDNA is available under GenBank Accession No. S66407. 

[0060] The ends of a cDNA segment encoding the Flt4 extracellular domain (EC) were modified as follows: The 3' 
end of the Flt4 cDNA (Genbank Accession Number X68203) extracellular domain sequence was amplified using primer 
1116 (5'-CTGGA GTCGAC TTGGCGGACT-3'; SEQ ID NO: 9, Sa/I site underlined) and primer 1315 

5 (5'-CGC GGATCC CTAGTGATGGTGATGGTGATGTCTACCTTCGATCATGCTGCCCTTAT CCTC-3'; (SEQ ID NO: 1 0, 
BamHI site underlined). The sequence at the 5' end of primer 1315 is not complementary to the Flt4 coding region. 
Inspection of the sequence that is complementary to this region of primer 1315 reveals in a 5' to 3' order, a stop codon, 
six contiguous histidine codons (for subsequent chromatographic purification of the encoded polypeptide using a 
Ni-NTA column; Qiagen, Hilden, Germany), and an added BamHI, site. The amplified fragment was digested with Sa/I 

10 and BamHI and used to replace a unique Sal\-Bam\-\\ fragment in the LTRFIt4 vector shown in Fig. 3. The Sa/I -BamHI 
fragment that was replaced encodes the Flt4 transmembrane and cytoplasmic domains. The result was a modified 
LTRFIt4 vector. 

[0061] The 5' end without the Flt4 signal sequence encoding region was amplified by PCR using the primer 1335 
(5'-CCC AAGCTTGGATCC AAGTGGCTACTCCATGACC-3'; (SEQ ID NO: 11) the primer contains added H/ndlll 

15 (AAGCTT) and BamHI (GGATCC) restriction sites, which are underlined). The second primer used to amplify the region 
encoding the Flt4 signal sequence was primer 1332. 5'-GTTGCCTGTGATGTGC ACCA-3' ; SEQ ID NO: 12). The am- 
plified fragment was digested with H/ndlll and Sph\ (the Hind\\\ site (AAGCTT) is underlined in primer 1335 and the 
Sph\ site is within the amplified region of the Flt4 cDNA). The resultant Hind\\\-Sph\ fragment was used to replace a 
H/ndlll-Spftl-fragment in the modified LTRFIt4 vector described immediately above (the H/nd 1 1 1 site is in the 5' junction 

20 of the Flt4 insert with the pLTRpoly portion of the vector, the Sph\ site is in the Flt4 cDNA). The resultant Flt4EC insert 
was then ligated as a BamHI fragment into the BamHI site in the pVTBac plasmid described in Tessier etal., Gene 98: 
177-183 (1991), incorporated herein by reference. The relative orientation of the insert was confirmed by partial se- 
quencing so that the open reading frame of the signal sequence-encoding portion of the vector was adjacent to, and 
in frame with, the Flt4 coding region sequence. The Flt4EC construct was transfected together with baculovirus genomic 

25 DNA into SF-9 cells by lipofection. Recombinant virus was purified, amplified and used for infection of High-Five cells 
(Invitrogen, San Diego, CA) using methods standard in the art. The Flt4 extracellular domain (Flt4EC) was purified 
from the culture medium of the infected High-Five cells using Ni-NTA affinity chromatography according to manufac- 
turer's instructions (Qiagen) for binding and elution of the 6xHis tag encoded in the COOK-terminus of the recombinant 
Flt4 extracellular domain. 

30 

EXAMPLE 4 

Isolation of an Flt4 Ligand from Conditioned Media 

35 [0062] A human Flt4 ligand according to the invention was isolated from media conditioned by a PC-3 prostatic 
adenocarcinoma cell line (ATCC CRL 1435) in Ham's F-12 Nutrient mixture (GIBCO) containing 7% fetal calf serum 
(FCS). The cells were grown according to the supplier's instructions. In order to prepare the conditioned media, con- 
fluent PC-3 cells were cultured for 7 days in Ham's F-1 2 Nutrient mixture (GIBCO) in the absence of fetal bovine serum 
(FBS). Medium was then cleared by centrifugation at 10,000 g for 20 minutes. The medium was then screened to 

40 determine its ability to induce tyrosine phosphorylation of Flt4 by exposure to NIH 3T3 cells which had been transfected 
with Flt4-encoding cDNA using the pl_TRFIt4l vector. For receptor stimulation experiments, subconfluent NIH 3T3 cells 
were starved overnight in serum-free DMEM medium (GIBCO) containing 0.2% bovine serum albumin (BSA). The 
cells were stimulated with the conditioned mediators minutes, washed twice with cold PBS containing 100 micromolar 
vanadate, and lysed in RIPA buffer (10 mM Tris pH 7.5, 50 mM NaCI, 0.5% sodium deoxycholate, 0.5% Nonidet P40 

45 (BDH, Poole, England), 0.1% SDS, 0.1 U/ml Aprotinin (Boehringer Mannheim), 1 mM vanadate) for receptor immuno- 
precipitation analysis. The lysates were centrifuged for 20 minutes at 15,000 x g. The supernatants were incubated 
for 2 hours on ice with 3 microliters of the antiserum against the Flt4 C-terminus described in Example 2. See also 
Pajusola et al,, Oncogene, 8:2931-2937 (1993), incorporated by reference herein. 

[0063] After a two hour incubation in the presence of anti-Flt4 antiserum, protein A-Sepharose (Pharmacia) was 
50 added and incubation was continued for 45 minutes with rotation. The immunoprecipitates were washed three times 
with the immunoprecipitation buffer and twice with 10 mM Tris, pH 7.5, before analysis by SDS-PAGE. Polypeptides 
were transferred to nitrocellulose and analyzed by Western blotting using Flt4- or phosphotyrosine-specific antisera 
and the ECL method (Amersham Corp.). Anti-phosphotyrosine monoclonal antibodies (anti-PTyr; PY20) were pur- 
chased from Transduction Laboratories (Lexington, Kentucky). In some cases, the filters were restained with a second 
55 antibody after stripping. The stripping of the filters was done for 30 minutes at 50°C in 100 mM 2-mercaptoethanol, 
2% SDS, 62.5 mM Tris-HCI, pH 6.7, with occasional agitation. 

[0064] As shown in Fig. 4, the PC-3 conditioned medium (PC-3CM), stimulated tyrosine phosphorylation of a 125 
kD polypeptide when Flt4- expressing NIH 3T3 cells were treated with the indicated preparations of media, lysed, and 
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the lysates were immunoprecipitated with anti-Flt4 antiserum followed by SDS-PAGE, Western blotting, and staining 
using anti-PTyr antibodies. The resulting band was weakly phosphorylated upon stimulation with unconcentrated PC- 
3 conditioned medium (lane 2). The 125 kD band comigrated with the tyrosine phosphorylated, processed form of the 
mature Flt4 from pervanadate-treated cells (compare lanes 2 and 7 of Fig. 4, see also Fig. 5A). Comigration was 

5 confirmed upon restaining with anti-Flt4 antibodies as is also shown in Fig. 5A (panel on the right). In order to show 
that the 125 kD polypeptide is not a non-specific component of the conditioned medium reactive with anti-phosphoty- 
rosine antibodies, 15 microliters of conditioned medium were separated by SDS-PAGE, blotted on nitrocellulose, and 
the blot was stained with anti-PTyr antibodies. No signal was obtained (Fig. 5B). Also, unconditioned medium failed to 
stimulate Flt4 phosphorylation, as shown in Fig. 4, lane 1. 

w [0065] Fig. 5C shows a comparison of the effects of PC-3 CM stimulation (+) on untransfected (lanes 4 and 5), FGFR- 
4-transfected (lanes 8 and 9) and Flt4-transfected NIH 3T3 cells (lanes 1 -3, 6 and 7). These results indicate that neither 
untransfected NIH 3T3 cells nor NIH 3T3 cells transfected with FGFR-4 showed tyrosine phosphorylation of a protein 
of about 125 kD upon stimulation with the conditioned medium from PC-3 cells. Analysis of stimulation by PC-3 CM 
pretreated with Heparin-Sepharose CL-6B (Pharmacia) for 2 hours at room temperature (lane 3) showed that the Flt4 

15 ligand does not bind to heparin. 

[0066] As shown in Fig. 4, lane 3, stimulating activity was considerably increased when the PC-3 conditioned medium 
was concentrated four-fold using a Centricon-1 0 concentrator (Amicon). Fig. 4, lane 4, shows that pretreatment of the 
concentrated PC-3 conditioned medium with 50 microliters of the Flt4 extracellular domain coupled to CNBr-activated 
sepharose CL-4B (Pharmacia; about 1mg of Flt4EC domain/ml sepharose resin) completely abolished Flt4 tyrosine 

20 phosphorylation. Similar pretreatment of the conditioned medium with unsubstituted sepharose CL-4B did not affect 
stimulatory activity, as shown in Fig. 4, lane 5. Also, the flow through obtained after concentration, which contained 
proteins of less than 10,000 molecular weight, did not stimulate Flt4 phosphorylation, as shown in Fig. 4, lane 6. 
[0067] In another experiment, a comparison of Flt4 autophosphorylation in transformed NIH 3T3 cells expressing 
LTRFIt4l was conducted, using unconditioned medium, medium from PC-3 cells expressing the Flt4 ligand, or uncon- 

25 ditioned medium containing either 50 ng/ml of VEGF1 65 or 50 ng/ml of PIGF-1 . The cells were lysed, immunoprecip- 
itated using anti-Flt4 antiserum and analyzed by Western blotting using anti-phosphotyrosine antibodies, only the PC- 
3 conditioned medium expressing the Flt4 ligand (lane Flt-4L) stimulated Flt4 autophosphorylation. 
[0068] The foregoing data show that PC-3 cells produce a ligand which binds to the extracellular domain of Flt4 and 
activates this receptor. 

30 

EXAMPLE 5 

Purification of the Flt4 Ligand 

35 [0069] The ligand expressed by human PC-3 cells as characterized in Example 4 was purified and isolated using a 
recombinantly-produced Flt4 extracellular domain (Flt4EC) in affinity chromatography. 

[0070] Two harvests of serum-free conditioned medium, comprising a total of 8 liters, were collected from 500 con- 
fluent 15 cm diameter culture dishes containing confluent layers of PC-3 cells. The conditioned medium was clarified 
by centrifugation at 1 0,000 x g and concentrated 80-fold using an Ultrasette Tangential Flow Device (Filtron, Northbor- 
40 ough, MA) with a 10 kD cutoff Omega Ultrafiltration membrane according to the manufacturer's instructions. Recom- 
binant Flt4 extracellular domain was expressed in a recombinant baculovirus cell system and purified by affinity chro- 
matography on Ni-agarose (Ni-NTA affinity column obtained from Qiagen). The purified extracellular domain was cou- 
pled to CNBr-activated Sepharose CL-4B at a concentration of 5 mg/ml and used as an affinity matrix for ligand affinity 
chromatography. 

45 [0071] Concentrated conditioned medium was incubated with 2 ml of the recombinant Flt4 extracellular domain- 
Sepharose affinity matrix in a rolling tube at room temperature for 3 hours. All subsequent purification steps were at 
+4 °C. The affinity matrix was then transferred to a column with an inner diameter of 1 5 mm and washed successively 
with 100 ml of PBS and 50 ml of 10 mM Na-phosphate buffer (pH 6.8). Bound material was eluted step-wise with 100 
mM glycine-HCI, successive 6 ml elutions having pHs of 4.0, 2.4, and 1.9. Several 2 ml fractions of the eluate were 

50 collected in tubes containing 0.5 ml 1 M Na-phosphate (pH 8.0). Fractions were mixed immediately and dialyzed in 1 
mM Tris-HCI (pH 7.5). Aliquots of 75 jnl each were analyzed for their ability to stimulate tyrosine phosphorylation of 
Flt4. The ultrafiltrate, 100 jllI aliquots of the concentrated conditioned medium before and after ligand affinity chroma- 
tography, as well as 15-fold concentrated fractions of material released from the Flt4 extracellular domain-Sepharose 
matrix during the washings were also analyzed for their ability to stimulate Flt4 tyrosine phosphorylation. 

55 [0072] As shown in Fig. 6, lane 3, the concentrated conditioned medium induced prominent tyrosine phosphorylation 
of Flt4 in transfected NIH 3T3 cells overexpressing Flt4. This activity was not observed in conditioned medium taken 
after medium was exposed to the Flt4 Sepharose affinity matrix described above (Fig. 6, lane 4). The specifically- 
bound Flt4-stimulating material was retained on the affinity matrix after washing in PBS, 10 mM Na-phosphate buffer 
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(pH 6.8), and at pH 4.0 (Fig. 6, lanes 5-7, respectively), and it was eluted in the first two 2 ml aliquots at pH 2.4 (lanes 
8 and 9). A further decrease of the pH of the elution buffer did not cause release of additional Flt4-stimulating material 
(Fig. 6, lane 1 1 ). Fig. 6, lane 1 depicts a control wherein Flt4-expressing cells were treated with unconditioned medium; 
lane 2 depicts the results following treatment of Flt4-expressing cells with the ultrafiltrate fraction of conditioned medium 

5 containing polypeptides of less than 1 0 kD molecular weight. 

[0073] Small aliquots of the chromatographic fractions were concentrated in a SpeedVac concentrator (Savant, 
Farmingdale, N.Y.) and subjected to SDS-PAGE under reducing conditions with subsequent silver staining of the gel, 
a standard technique in the art. As shown in Fig. 7, the major polypeptide, having a molecular weight of approximately 
23 kD (reducing conditions), was detected in the fractions containing Flt4 stimulating activity (corresponding to lanes 

10 8 and 9 in Fig. 6). That polypeptide was not found in the other chromatographic fractions. On the other hand, besides 
these bands and a very faint band having a 32 kD mobility, all other components detected in the two active fractions 
were also distributed in the starting material and in small amounts in the other washing and eluting steps after their 
concentration. Similar results were obtained in three independent affinity purifications, indicating that the 23 kD polypep- 
tide specifically binds to Flt4 and induces tyrosine phosphorylation of Flt4. 

15 [0074] Fractions containing the 23 kD polypeptide were combined, dried in a SpeedVac concentrator and subjected 
to SDS-PAGE in a 12.5% gel. The proteins from the gel were then electroblotted to Immobilon-P (PVDF) transfer 
membrane (Millipore, Marlborough, MA) and visualized by staining of the blot with Coomassie Blue R-250. The region 
containing only the stained 23 kD band was cut from the blot and subjected to N-terminal amino acid sequence analysis 
in a Prosite Protein Sequencing System (Applied Biosystems, Foster City, CA). The data were analyzed using a 61 OA 

20 Data Analysis System (Applied Biosystems). Analysis revealed a single N-terminal sequence of NH 2 -XEETIK- 
FAAAHYNTEILK-COOH (SEQ ID NO: 13). 

EXAMPLE 6 

25 Construction of PC-3 cell cDNA library in a eukaryotic expression vector 

[0075] Human poly(A) + RNA was isolated from five 15 cm diameter dishes of confluent PC-3 cells by a single step 
method using oligo(dT) (Type III, Collaborative Biomedical Products, Becton-Dickinson Labware, Bedford, MA) cellu- 
lose affinity chromatography (Sambrook et al., 1989). The yield was 70 micrograms. Six micrograms of the Poly(A) + 
30 RNA were used to prepare an oligo(dT)-primed cDNA library in the mammalian expression vector pcDNA I and the 
Librarian kit of Invitrogen according to the instructions included in the kit. The library was estimated to contain about 
10 6 independent recombinants with an average insert size of approximately 1.8 kb. 

EXAMPLE 7 

35 

Amplification of a unique nucleotide sequence encoding the Flt4 ligand amino terminus 

[0076] Degenerate oligonucleotides were designed based on the N-terminal amino acid sequence of the isolated 
human Flt4 ligand and were used as primers in a polymerase chain reaction (PCR) to amplify cDNA encoding the Flt4 
40 ligand from the PC-3 cDNA library. The overall strategy described in Examples 7 and 8 is schematically depicted in 
Fig. 9, where the different primers have been marked with arrows. 

[0077] The PCR was carried out using 1 microgram of DNA from the amplified PC-3 cDNA library and a mixture of 
48 sense-strand primers present in equal proportions, the primer sequences collectively comprising the sequence 
5'-GCAGARGARACNATHAA-3' (SEQ ID NO: 14) (wherein R is A or G, N is A,G,C or T and H is A, C or T), encoding 

45 amino acid residues 2-6 (EETIK, SEQ ID NO: 15) and 384 antisense-strand primers present in equal proportions, the 
anti-sense strand primers collectively comprising the sequence 5'-GCAYTTNARDATYTCNGT-3' (SEQ ID NO: 16) 
(wherein Y is C or T and D is A, G or T), corresponding to amino acid residues 14-18 (TEILK, SEQ ID NO: 17). Three 
extra nucleotides (GCA) were added to the 5'-terminus of each primer to increase annealing stability. Two successive 
PCR runs were carried out using 1 U per reaction of DynaZyme (F-500L, Finnzymes, Espoo, Finland), a thermostable 

50 DNA polymerase, in a buffer supplied by the manufacturer (10 mM Tris-HCI, pH 8.8 at 25°C, 1.5 mM MgCI 2 , 50 mM 
KCI, 0.1% Triton-X100) , at an extension temperature of 72°C. The first PCR run was carried out for 43 cycles. The 
first three cycles were run at an annealing temperature of 33°C for 2 minutes, and the remaining cycles were run at 
42°C for 1 minute. 

[0078] The region of the gel containing a weak band of the expected size (57 bp) was cut out from the gel and eluted. 
55 The eluted material was reamplified for 30 cycles using the same primer pairs described above at 42°C for 1 minute. 
The amplified fragment was cloned into a pCR II vector (Invitrogen) using the TA cloning kit (Invitrogen) and sequenced 
using the radioactive dideoxynucleotide sequencing method of Sanger. Six clones were analyzed and all six clones 
contained the sequence encoding the expected peptide (amino acid residues 104-120 of the Flt4 ligand precursor). 
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Nucleotide sequence spanning the region from the third nucleotide of codon 6 to the third nucleotide of codon 13 (the 
extension region) was identical in all six clones: 5'-ATTCGCTGCAGCACACTACAAC-3' (SEQ ID NO: 18) and thus 
represented an amplified product from the unique sequence encoding part of the amino terminus of the Flt4 ligand. 

5 EXAMPLE 8 

Amplification of the 5'-end of the cDNA encoding the Flt4 ligand 

[0079] Based on the unique nucleotide sequence encoding the N-terminus of the isolated human Flt4 ligand, two 

10 pairs of nested primers were designed to amplify, in two nested PCR reactions, the complete 5'-end of the corresponding 
cDNAs from one microgram of DNA of the above-described PC-3 cDNA library. First, amplification was performed with 
an equal mixture of 4 primers collectively defining the sequence 5'-TCNGTGTTGTAGTGTGCTG-3' (SEQ ID NO: 1 9), 
which is the antisense-strand primer corresponding to amino acid residues 9-15 (AAHYNTE, SEQ ID NO: 20), and 
sense-strand primer 5'-TAATACGACTCACTATAGGG-3' (SEQ ID NO: 21 ), corresponding to the T7 RNA promoter of 

15 the pcDNAI vector used for construction of the library. "Touchdown" PCR was used as disclosed in Don et al., Nucl. 
Acids Res., 79:4008 (1991), incorporated by reference herein. The annealing temperature of the two first cycles was 
62°C and subsequently the annealing temperature was decreased in every other cycle by 1 °C until a final temperature 
of 53°C was reached, at which temperature 16 additional cycles were conducted. Annealing time was 1 minute and 
extension at each cycle was conducted at 72°C for 1 minute. Multiple amplified DNA fragments were obtained in the 

20 first reaction. The products of the first amplification (1 jul of a 1 :1 00 dilution in water) were used in the second amplifi- 
cation reaction employing a pair of nested primers comprising an antisense-strand primer 5'-GTTGTAGTGTGCT- 
GCAGCGAATTT-3'; SEQ ID NO: 22) encoding amino acid residues 6-13 (KFAAAHYN, SEQ ID NO: 23) of the Flt4 
ligand, and a sense-strand primer (5'-TC ACTATAGGG AGACCCAAGC-3' ; SEQ ID NO: 24), corresponding to nucle- 
otides 21 79-21 99 of the pcDNAI vector. The sequences of these sense and antisense primers overlapped with the 3' 

25 ends of the corresponding primers used in the first PCR. "Touchdown" PCR was carried out by decreasing the annealing 
temperature from 72°C to 66°C and continuing with 18 additional cycles at 66°C. The annealing time was 1 minute 
and extension at each cycle was carried out at 72°C for 2 minutes. One major product of about 220 bp and three minor 
products of about 270 bp, 1 50 bp, and 1 00 bp were obtained. 

[0080] The amplified fragment of approximately 220 bp was excised from an agarose gel, cloned into a pCRII vector 
30 using the TA cloning kit (Invitrogen), and sequenced. Three recombinant clones were analyzed and they contained the 
sequence 5'-TCACTATAGGGAGACCCAAGCTTGGTACCGAGCTCGGATCCACTAGTAACGGCCGCC 
AGTGTGGTGGAATTCGAC GAACTCATGACTGTACTCTACCCAGAATATTGGAAAATG 
TACAAGTGTCAGCTAAGGCAAGGAGGCTGGCAACATAACAGAGAACAGGCCAACCTC 

AACTCAAGGACAGAAGAGACTATAAAATTCGCTGCAGCACACTACAAC - 3' (SEQ ID NO: 25). The beginning of the 
35 sequence represents the pcDNAI vector and the underlined sequence represents the amplified product of the 5'-end 
of the cDNA insert. 

EXAMPLE 9 

40 Amplification of the 3'-end of cDNA encoding the Flt4 ligand 

[0081] Based upon the amplified 5'-sequence of the clones encoding the amino terminus of the 23 kD human Flt4 
ligand, two pairs of non-overlapping nested primers were designed to amplify the 3'-portion of the Flt-4-ligand-encoding 
cDNA clones. The sense-strand primer 5'-ACAGAGAACAGGCCAACC-3' (SEQ ID NO: 26), corresponding to nucle- 
us otides 152-1 69 of the amplified 5'-sequences of the Flt4 ligand (SEQ ID NO: 25), and antisense-strand primer 5'-TCTAG- 
CATTTAGGTGACAC-3' (SEQ ID NO: 27) corresponding to nucleotides 2311-2329 of the pcDNAI vector were used in 
a first "touchdown" PCR. The annealing temperature of the reaction was decreased 1°C every two cycles from 72°C 
to 52°C, at which temperature 15 additional cycles were carried out. The annealing time was 1 minute and extension 
at each cycle was carried out at 72°C for 3 minutes. DNA fragments of several sizes were obtained in the first ampli- 
50 fication. Those products were diluted 1 :200 in water and reamplified in PCR using the second pair of primers: 5'-AA- 
GAGACTATAAAATTCGCTGCAGC-3' (SEQ ID NO: 28) and 5'-CCCTCTAGATGCATGCTCGA-3' (SEQ ID NO: 29) (an- 
tisense-strand primer corresponding to nucleotides 2279-2298 of the pcDNAI vector). Two DNA fragments were ob- 
tained, having sizes of 1350 bp and 570 bp. Those fragments were cloned into a pCRII vector and the inserts of the 
clones were sequenced. Both of these fragments were found to contain sequences encoding an amino acid sequence 
55 homologous to the VEGF sequence. 
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EXAMPLE 10 

Screening the PC-3 cell cDHA library using the 5' PCR fragment of Flt4 ligand cDNA 

5 [0082] A 219 bp 5'-terminal fragment of human Flt4 ligand cDNA was amplified by PCR using the 5' PCR fragment 
described above and primers 5'-GTTGTAGTGTGCTGCAGCGAATTT-3' (antisense-strand primer, SEQ I D NO: 30) and 
5'-TCACTATAGGGAGACCCAAGC-3' (SEQ I D NO: 31 ) (sense-primer corresponding to nucleotides 21 79-21 99 of the 
pcDNAI vector). The amplified product was subjected to digestion with EcoRI (Boehringer Mannheim) to remove the 
portion of the DNA sequence amplified from the pcDNAI vector and the resulting 153 bp fragment encoding the 5' end 

10 of the Flt4 ligand was labeled with [ 32 P]-dCTP using the Klenow fragment of E. coli DNA polymerase I (Boehringer 
Mannheim). That fragment was used as a probe for hybridization screening of the amplified PC-3 cell cDNA library. 
[0083] Filter replicas of the library were hybridized with the radioactively labeled probe at 42°C for 20 hours in a 
solution containing 50% formamide, 5x SSPE, 5x Denhardt's solution, 0.1% SDS and 0.1 mg/ml denatured salmon 
sperm DNA. Filters were washed twice in 1x SSC, 0.1% SDS for 30 minutes at room temperature, then twice for 30 

15 minutes at 65°C and exposed overnight. 

[0084] On the basis of autoradiography, 10 positive recombinant bacterial colonies hybridizing with the probe were 
chosen from the library. Plasmid DNA was purified from these colonies and analyzed by EcoRI and Not\ digestion and 
agarose gel electrophoresis followed by ethidium bromide staining. The ten plasmid clones were divided into three 
groups on the basis of the presence of insert sizes of approximately 1 .7, 1 .9 and 2. 1 kb, respectively. Inserts of plasmids 

20 from each group were sequenced using the T7 oligonucleotide as a primer and walking primers for subsequent se- 
quencing reactions. 

[0085] Sequence analysis showed that all clones contain the open reading frame encoding the NH2-terminal se- 
quence of the 23 kD human Flt4 ligand. Dideoxy sequencing was continued using walking primers in the downstream 
direction. A complete human cDNA sequence and deduced amino acid sequence from a 2 kb clone is set forth in SEQ 
25 id NOs: 32 and 33, respectively. A putative cleavage site of a "prepro" leader sequence is located between residues 

102 and 103 of SEQ ID NO: 33. When compared with sequences in the GenBank Database, the predicted protein 
product of this reading frame was found to be homologous with the predicted amino acid sequences of the PDGF/ 
VEGF family of growth factors, as shown in Fig. 10. 

[0086] Plasmid pFLT4-L, containing the 2.1 kb human cDNA clone in pcDNAI vector, has been deposited with the 
30 American Type Culture Collection, 12301 Parklawn Drive, Rockville, MD 20852 as accession number 97231 . 

EXAMPLE 11 

Stimulation of Flt4 autophosphorylation by the protein product of the Flt4 ligand vector 

35 

[0087] The 2.1 kb human cDNA insert of plasmid pFlt4-L, which contains the open reading frame encoding the 
sequence shown in SEQ ID NOs: 32 and 33; human VEGF-C, see below), was cut out from the pcDNAI vector using 
Hind\\\ and Not\ restriction enzymes, isolated from a preparative agarose gel, and ligated to the corresponding sites 
in the pREP7 expression vector (Invitrogen). The pREP7 vector containing the pFlt4-L insert was transfected into 

40 293-EBNA cells (Invitrogen) using the calcium phosphate transfection method (Sambrooke? a/., 1989). About 48 hours 
after transfection the medium of the transfected cells was changed to DMEM medium lacking fetal calf serum and 
incubated for 36 h. The conditioned medium was then collected, centrifuged at 5000 x g for 20 minutes, the supernatant 
was concentrated 5-fold using Centriprep 10 (Amicon) and used to stimulate NIH 3T3 cells expressing LTRFIt4l (the 
Flt4 receptor), as in Example 4. The cells were lysed, immunoprecipitated using anti-Flt4 antiserum and analyzed by 

45 Western blotting using anti-phosphotyrosine antibodies. 

[0088] The conditioned medium from two different dishes of the transfected cells stimulated Flt4 autophosphorylation 
in comparison with the medium from mock-transfected cells, which gave only background levels of phosphorylation of 
the Flt4 receptor. When the concentrated conditioned medium was pre-absorbed with 20 microliters of a slurry of Flt4EC 
domain coupled to Sepharose (see example 4), no phosphorylation was obtained, showing that the activity responsible 

50 for Flt4 autophosphorylation was indeed the Flt4 ligand. Thus, these results demonstrate that an expression vector 
having an approximately 2.1 kb insert and containing an open reading frame as shown in SEQ ID NO: 32 is expressed 
as a biologically active Flt4 ligand (VEGF-C) in transfected cells. The sequence encoded by that open reading frame 
is shown in SEQ ID NO: 33. 

[0089] The deduced molecular weight of a polypeptide consisting of the complete amino acid sequence in SEQ ID 
55 NO: 33 (residues 1 to 41 9) is 46,883. The deduced molecular weight of a polypeptide consisting of amino acid residues 

103 to 419 of SEQ ID NO: 33 is 35,881. The Flt4 ligand purified from PC-3 cultures had an observed molecular weight 
of about 23 kD as assessed by SDS-PAGE under reducing conditions. Thus, it appears that the Flt4 ligand mRNA is 
translated into a precursor polypeptide, from which the mature ligand is derived by proteolytic cleavage. Also, the Flt4 
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ligand may be glycosylated at three putative N-linked glycosylation sites conforming to the consensus which can be 
identified in the deduced Flt4 ligand amino acid sequence (N-residues underlined in Fig. 10). 

[0090] The carboxyl terminal amino acid sequences, which increase the predicted molecular weight of the Flt4 ligand 
subunit in comparison with other ligands of this family, show a pattern of spacing of cysteine residues reminiscent of 
5 the Balbiani ring 3 protein (BR3P) sequence (Dignam et al., Gene, 88:133-140 (1990)), as depicted schematically in 
Fig. 9. Such a sequence may encode an independently folded domain present in a Flt4 ligand precursor and it may 
be involved, for example, in the regulation of secretion, solubility, stability, cell surface localization or activity of the Flt4 
ligand. Interestingly, at least one cysteine motif of the BR3P type is also found in the VEGF carboxy terminal amino 
acid sequences. 

w [0091] Thus, the Flt4 ligand mRNA appears first to be translated into a precursor from the mRNA corresponding to 
the cDNA insert of plasmid FLT4-L, from which the mature ligand is derived by proteolytic cleavage. To define the 
mature Flt4 ligand polypeptide, one first expresses the cDNA clone (which is deposited in the pcDNAI expression 
vector) in cells, such as COS cells. One uses antibodies generated against encoded polypeptides, fragments thereof, 
or bacterial Flt4 fusion proteins, such as a GST-fusion protein, to raise antibodies against the VEGF-homologous 

15 domain and the amino- and carboxyl-terminal propeptides of Flt4 ligand. One then follows the biosynthesis and process- 
ing of the Flt4 ligand in the transfected cells by pulse-chase analysis using radioactive cysteine for labelling of the cells, 
immunoprecipitation and gel electrophoresis. Using antibodies against the three domains of the product encoded by 
the cDNA insert of plasmid FLT4-L, material for radioactive or nonradioactive amino-terminal sequence analysis is 
isolated. The determination of the amino-terminal sequence of the mature VEGF-C polypeptide allows for identification 

20 of the amino-terminal proteolytic processing site. The determination of the amino-terminal sequence of the carboxyl- 
terminal propeptide will give the carboxyl-terminal processing site. This is confirmed by site-directed mutagenesis of 
the amino acid residues adjacent to the cleavage sites, which would prevent the cleavage. 

[0092] The Flt4 ligand is further characterizeable by progressive 3' deletions in the 3' coding sequences of the Flt4 
ligand precursor clone, introducing a stop codon resulting in carboxy-terminal truncations of its protein product. The 

25 activities of such truncated forms are assayed by, for example, studying Flt4 autophosphorylation induced by the trun- 
cated proteins when applied to cultures of cells, such as NIH 3T3 cells expressing LTRFIt4. By extrapolation from 
studies of the structure of the related platelet derived growth factor (PDGF, Heldin et al., Growth Factors, 8:245-252 
(1993)) one determines that the region critical for receptor activation by the Flt4 ligand is contained within the first 
approximately 180 amino acid residues of the secreted VEGF-C protein lacking the putative 102 amino acid prepro 

30 leader (SEQ ID NO: 33, residues 103-282), and apparently within the first approximately 120 amino acid residues (SEQ 
ID NO: 33, residues 103-223). 

[0093] On the other hand, the difference between the molecular weights observed for the purified ligand and deduced 
from the open reading frame of the Flt4 ligand clone may be due to the fact that the soluble ligand was produced from 
an alternatively spliced mRNA which would also be present in the PC-3 cells, from which the isolated ligand was 

35 derived. To isolate such alternative cDNA clones one uses cDNA fragments of the deposited clone and PCR primers 
made according to the sequence provided as well as techniques standard in the art to isolate or amplify alternative 
cDNAs from the PC-3 cell cDNA library. One may also amplify using reverse transcription (RT)-PCR directly from the 
PC-3 mRNA using the primers provided in the sequence of the cDNA insert of plasmid FLT4-L. Alternative cDNA 
sequences are determined from the resulting cDNA clones. One can also isolate genomic clones corresponding to the 

40 Flt4 ligand mRNA transcript from a human genomic DNA library using methods standard in the art and to sequence 
such clones or their subcloned fragments to reveal the corresponding exons. Alternative exons can then be identified 
by a number of methods standard in the art, such as heteroduplex analysis of cDNA and genomic DNA, which are 
subsequently characterized. 

45 EXAMPLE 12 

Expression of the Gene Encoding VEGF-C in Human Tumor Cell Lines 

[0094] Expression of transcripts corresponding to the Flt4 ligand (VEGF-C) was analyzed by hybridization of Northern 
50 blots containing isolated poly(A) + RN A from HT-1 080 and PC-3 human tumor cell lines. The probe was the radioactively 
labelled insert of the 2.1 kb cDNA clone (pFlt4-L/VEGF-C, specific activity 10 8 -10 9 cpm/mg of DNA). The blot was 
hybridized overnight at 42°C using 50% formamide, 5x SSPE buffer, 2% SDS, 10 x Denhardt's solution, 100 mg/ml 
salmon sperm DNA and 1 x 10 6 cpm of the labelled probe/ml. The blot was washed at room temperature for 2 x 30 
minutes in 2x SSC containing 0.05% SDS, and then for 2 x 20 minutes at 52°C in 0.1x SSC containing 0.1% SDS. The 
55 blot was then exposed at -70°C for three days using intensifying screens and Kodak XAR film. Both cell lines expressed 
an Flt4 ligand mRNA of about 2.4 kb, as well as VEGF and VEGF-B mRNAs (Fig. 12). 
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EXAMPLE 13 

VEGF-C Chains Are Proteolytically Processed after Biosynthesis and Disulfide Linked 

5 [0095] The predicted molecular mass of a secreted human VEGF-C polypeptide, as deduced from the VEGF-C open 
reading frame, is 46,883 kD, suggesting that VEGF-C mRNA may be first translated into a precursor, from which the 
ligands of 21/23 kD and 29/32 kD are derived by proteolytic cleavage. 

[0096] This possibility was explored by metabolic labelling of 293 EBNA cells expressing VEGF-C. Initially, 293 EBNA 
cells were transfected with the VEGF-C construct. Expression products were labeled by the addition of 100 jnCi/ml of 

10 Pro-mix™ L-[ 35 S] in vitro cell labelling mix ((containing 35 S-methionine and 35 S-cysteine) Amersham, Buckingham- 
shire, England) to the culture medium devoid of cysteine and methionine. After two hours, the cell layers were washed 
twice with PBS and the medium was then replaced with DM EM-0.2% BSA. After 1 , 3, 6, 1 2 and 24 hours of subsequent 
incubation, the culture medium was collected, clarified by centrifugation, and concentrated, and human VEGF-C was 
bound to 30 jllI of a slurry of Flt4EC-Sepharose overnight at +4°C, followed by three washes in PBS, two washes in 20 

15 mM Tris-HCI (pH 7.5), alkylation, SDS-PAGE and autoradiography. Alkylation was carried out by treatment of the sam- 
ples with 10mM 1,4 Dithiothreitol (Boehringer-Mannheim, Mannheim, Germany) forone hour at25°C, and subsequently 
with 30 mM iodoacetamide (Fluka, Buchs, Switzerland). 

[0097] These experiments demonstrated that a putative precursor polypeptide of 32 kD apparent molecular mass 
was bound to the Flt4EC affinity matrix from the conditioned medium of metabolically labelled cells transfected with 

20 the human VEGF-C expression vector (Fig. 1 3A) , but not from mock (M) transfected cells. Increased amounts of a 23 
kD receptor binding polypeptide accumulated in the culture medium of VEGF-C transfected cells during a subsequent 
chase period of three hours, but not thereafter (lanes 2-4 and data not shown), suggesting that the 23 kD form is 
produced by proteolytic processing, which is incomplete, at least in the transiently transfected cells. The arrows in Fig. 
13A indicate the 32 kD and 23 kD polypeptides of secreted VEGF-C. Subsequent experiments showed that the 32kD 

25 VEGF-C form contains two components migrating in the absence of alkylation as polypeptides of 29 and 32 kD (Figs. 
21-23). 

[0098] In a related experiment, human VEGF-C isolated using Flt4EC-Sepharose after a 4 h continuous metabolic 
labelling was analyzed by polyacrylamide gel electrophoresis in nonreducing conditions (Fig. 13B). Higher molecular 
mass forms were observed under nonreducing conditions, suggesting that the VEGF-C polypeptides can form disulfide- 
30 linked dimers and/or multimers (arrows in Fig. 13B). 

EXAMPLE 14 

Stimulation Of VEGFR-2 Autophosphorylation By VEGF-C 

35 

[0099] Conditioned medium (CM) from 293 EBNA cells transfected with the human VEGF-C vector also was used 
to stimulate porcine aortic endothelial (PAE) cells expressing VEGFR-2 (Kdr). Pajusola et al., Oncogene, 9:3545-55 
(1994); Waltenberger et al., J. Biol. Chem., 269:26988-26995 (1994). The cells were lysed and immunoprecipitated 
using VEGFR-2 - specific antiserum (Waltenberger et al., 1994). 

40 [0100] PAE-KDR cells (Waltenberger et al., 1994) were grown in Ham's F12 medium-10% fetal calf serum (FCS). 
Confluent NIH 3T3-Flt4 cells or PAE-KDR cells were starved overnight in DM EM or Ham's F12 medium, respectively, 
supplemented with 0.2% bovine serum albumin (BSA), and then incubated for 5 minutes with the analyzed media. 
Recombinant human VEGF (R&D Systems) and PDGF-BB, functional as stimulating agents, were used as controls. 
The cells were washed twice with ice-cold Tris-Buffered Saline (TBS) containing 100 mM sodium orthovanadate and 

45 lysed in RIPA buffer containing 1 mM phenylmethylsulfonyl fluoride (PMSF), 0.1 U/ml aprotinin and 1 mM sodium 
orthovanadate. The lysates were sonicated, clarified by centrifugation at 16,000 x g for 20 minutes and incubated for 
3-6 hours on ice with 3-5 |iil of antisera specific for Flt4 (Pajusola etal., 1 993), VEGFR-2 or PDGFR-p (Claesson-Welsh 
et al., J. Biol. Chem., 264:1742-1747 (1989); Waltenberger et al., 1994). Immunoprecipitates were bound to protein 
A-Sepharose, washed three times with RIPA buffer containing 1mM PMSF, 1 mM sodium orthovanadate, washed twice 

50 with 1 0 mM Tris-HCI (pH 7.4), and subjected to SDS-PAGE using a 7% gel. Polypeptides were transferred to nitrocel- 
lulose by Western blotting and analyzed using PY20 phosphotyrosine-specific monoclonal antibodies (Transduction 
Laboratories) or receptor-specific antiserum and the ECL detection method (Amersham Corp.). 
[0101] The results of the experiment are presented in Figs. 14A and 14B. As shown in Fig. 14A, PAE cells expressing 
VEGFR-2 were stimulated with 10- or 2-fold concentrated medium from mock-transfected 293-EBNA cells (lanes 1 

55 and 2), or with 2-, 5- or 1 0-fold concentrated medium from 293-EBNA cell cultures expressing the recombinant VEGF-C 
(lanes 3-6). VEGFR-2 was immunoprecipitated with specific antibodies and analyzed by SDS-PAGE and Western 
blotting using phosphotyrosine antibodies. For comparison, the stimulations were carried out with non-conditioned 
medium containing 50 ng/ml of purified recombinant VEGF (lanes 7 and 8). Lanes 6 and 7 show stimulation with 
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VEGF-C- or VEGF- containing media pretreated with Flt4EC. As depicted in Fig. 14B, PDGFR-p-expressing NIH 3T3 
cells were stimulated with non-conditioned medium (lane 1), 5-fold concentrated CM from mock-transfected (lane 2) 
or VEGF-C - transfected (lanes 3 and 4) cells, or with non-conditioned medium containing 50 ng/ml of recombinant 
human PDGF-BB (lane 5). Medium containing VEGF-C was also pretreated with recombinant Flt4EC (lane 4). PDGFR- 

5 p was immunoprecipitated with specific antibodies and analyzed by SDS-PAGE and Western blotting using phospho- 
tyrosine antibodies with subsequent stripping and reprobing of the membrane with antibodies specific for PDGFR-p. 
[0102] Referring again to Fig. 14A, a basal level of tyrosine phosphorylation of VEGFR-2 was detected in cells stim- 
ulated by CM from the mock-transfected cells. A further concentration of this medium resulted in only a slight enhance- 
ment of VEGFR-2 phosphorylation (lanes 1 and 2). CM containing recombinant VEGF-C stimulated tyrosine autophos- 

10 phorylation of VEGFR-2 and the intensity of the autophosphorylated polypeptide band was increased upon concen- 
tration of the VEGF-C CM (lanes 3-5). Furthermore, the stimulating effect was abolished after pretreatment of the 
medium with the Flt4EC affinity matrix (compare lanes 1 , 5 and 6). The maximal effect of VEGF-C in this assay was 
comparable to the effect of recombinant VEGF added to unconditioned medium at concentration of 50 ng/ml (lane 8). 
Pretreatment of the medium containing VEGF with Flt4EC did not abolish its stimulating effect on VEGFR-2 (compare 

15 lanes 7 and 8). These results suggest that the VEGF-C expression vector encodes a ligand not only for Flt4 (VEGFR- 
3), but also for VEGFR-2 (Kdr). 

[0103] In order to further confirm that the stimulating effect of VEGF-C on tyrosine phosphorylation of VEGFR-3 and 
VEGFR-2 was receptor-specific, we analyzed the effect of VEGF-C on tyrosine phosphorylation of PDGF receptor p 
(PDGFR-P) which is abundantly expressed on fibroblastic cells. As can be seen from Fig. 14B, a weak tyrosine phos- 
20 phorylation of PDGFR-p was detected upon stimulation of Flt4-expressing NIH 3T3 cells with CM from the mock- 
transfected cells (compare lanes 1 and 2). A similar low level of PDGFR-p phosphorylation was observed when the 
cells were incubated with CM from the VEGF-C transfected cells, with or without prior treatment with Flt4EC (lanes 3 
and 4). In contrast, the addition of 50 ng/ml of PDGF-BB induced a prominent tyrosine autophosphorylation of PDGFR- 
p (lane 5). 

25 

EXAMPLE 15 

VEGF-C Stimulates Endothelial Cell Migration In Collagen Gel 

30 [0104] Conditioned media (CM) from cell cultures transfected with the VEGF-C expression vector was placed in a 
well made in collagen gel and used to stimulate the migration of bovine capillary endothelial (BCE) cells in the three- 
dimensional collagen gel as follows. 

[0105] BCE cells (Folkman et al., Proc. Natl. Acad, Sci. (USA), 76:5217-5221 (1979) were cultured as described in 
Pertovaara etal., J. Biol. Chem., 269:6271-74 (1994). The collagen gels were prepared by mixing type I collagen stock 

35 solution (5 mg/ml in 1 mM HCI) with an equal volume of 2x MEM and 2 volumes of MEM containing 1 0% newborn calf 
serum to give a final collagen concentration of 1 .25 mg/ml. The tissue culture plates (5 cm diameter) were coated with 
about 1 mm thick layer of the solution, which was allowed to polymerize at 37°C. BCE cells were seeded on top of this 
layer. For the migration assays, the cells were allowed to attach inside a plastic ring (1 cm diameter) placed on top of 
the first collagen layer. After 30 minutes, the ring was removed and unattached cells were rinsed away. A second layer 

40 of collagen and a layer of growth medium (5% newborn calf serum (NCS)), solidified by 0.75% low melting point agar 
(FMC BioProducts, Rockland, ME), were added. A well (3 mm diameter) was punched through all the layers on both 
sides of the cell spot at a distance of 4 mm, and the sample or control media were pipetted daily into the wells. Phot- 
omicrographs of the cells migrating out from the spot edge were taken after six days through an Olympus CK 2 inverted 
microscope equipped with phase-contrast optics. The migrating cells were counted after nuclear staining with the 

45 fluorescent dye bisbenzimide (1 mg/ml, Hoechst 33258, Sigma). 

[0106] Fig. 15 depicts a comparison of the number of cells migrating at different distances from the original area of 
attachment towards wells containing media conditioned by the non-transfected (control) or transfected (mock; VEGF-C; 
VEGF) cells, 6 days after addition of the media. The number of cells migrating out from the original ring of attachment 
was counted in five adjacent 0.5 mm x 0.5 mm squares using a microscope ocular lens grid and 1 0x magnification with 

50 a fluorescence microscope. Cells migrating further than 0.5 mm were counted in a similar way by moving the grid in 
0.5 mm steps. The experiments were carried out twice with similar results, and medium values from the one of the 
experiments are presented with standard error bars. As can be seen from the columns, VEGF-C-containing CM stim- 
ulated cell migration more than medium conditioned by the non-transfected or mock-transfected cells but less than 
medium from cells transfected with a VEGF expression vector. Daily addition of 1 ng of FGF2 into the wells resulted 

55 in the migration of approximately twice the number of cells when compared to the stimulation by CM from VEGF- 
transfected cells. 
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EXAMPLE 16 

VEGF-C Is Expressed In Multiple Tissues 

[0107] Northern blots containing 2 micrograms of isolated poly(A) + RNA from multiple human tissues (blot from Clon- 
tech Laboratories, Inc., Palo Alto, CA) were probed with radioactively labelled insert of the 2.1 kb VEGF-C cDNA clone. 
Northern blotting and hybridization analysis showed that the 2.4 kb RNA and smaller amounts of a 2.0 kb mRNA are 
expressed in multiple human tissues, most prominently in the heart, placenta, muscle, ovary and small intestine (Fig. 
1 6A). Very little VEGF-C RNA was seen in the brain, liver or thymus and peripheral blood leukocytes (PBL) appeared 
negative. A similar analysis of RNA from human fetal tissues (Fig. 1 6B) shows that VEGF-C is highly expressed in the 
kidney and lung and to a lesser degree in the liver, while essentially no expression is detected in the brain. Interestingly, 
VEGF expression correlates with VEGF-C expression in these tissues, whereas VEGF-B is highly expressed in all 
tissues analyzed. 

EXAMPLE 17 

The VEGF-C Gene Localizes To Chromosome 4q34 

[0108] A DNA panel of 24 interspecies somatic cell hybrids, which had retained one or two human chromosomes, 
was used for the chromosomal localization of the VEGF-C gene (Bios Laboratories, Inc., New Haven, CT). Primers 
were designed to amplify an about 250 bp fragment of the VEGF-C gene from somatic cell hybrid DNA. The primers 
and conditions for polymerase chain reaction (PCR) were 5'-TGAGTGATTTGTAGCTGCTGTG-3' (forward) [SEQ ID 
NO:34] and 5'-TATTGCAGCAACCCCCACATCT-3' (reverse) [SEQ ID NO: 35] for VEGF-C (94°C, 60s/62°C, 45s/72°C, 
60s). The PCR products were evaluated by electrophoresis in 1% agarose gels and visualized by ethidium bromide 
staining in ultraviolet light. [oc- 32 P]-dCTP- labelled cDNA inserts of a plasmid representing the complete VEGF-C coding 
domain was used as a probe in Southern blotting and hybridization analysis of the somatic cell hybrid DNAs as in- 
structed by the supplier (Bios Laboratories). 

[0109] The cell lines for fluorescence in situ hybridization (FISH) were obtained from the American Type Culture 
Collection (Rockville, MD). Purified DNA from P1 clones 7660 and 7661 (VEGF-C) (Genome Systems, Inc., St. Louis, 
NO) were confirmed positive by Southern blotting of EcoRI- digested DNA followed by hybridization with the VEGF-C 
cDNA. The P1 clones were then labelled by nick translation either with biotin-1 1 -dUTP, biotin-1 4-ATP (Sigma Chemical 
Co., St. Louis, MO)ordigoxigenin 11-dUTP (Boehringer Mannheim GmbH, Mannheim, Germany) according to standard 
protocols. PHA-stimulated peripheral blood lymphocyte cultures were treated with 5-bromodeoxyuridine (BrdU) at an 
early replicating phase to induce G-banding. See Takahashi et ai., Human Genet., 86:14-16 (1995); Lemieux et ai., 
Cytogenet. Cell Genet., 59:311-12 (1992). The FISH procedure was carried out in 50% formamide, 10% dextran sul- 
phate in 2x SSC using well-known procedures. See e.g., Rytkonnen et ai, Cytogenet Cell Genet., 68:61-63 (1995); 
Lichter et ai, Proc. Natl. Acad. Sci. (USA), 85:9664-68 (1988). Repetitive sequences were suppressed with 50-fold 
excess of Cot-1 DNA (BRL, Gaithersburg, MD) compared with the labeled probe. Specific hybridization signals were 
detected by incubating the hybridized slides in labelled antidigoxigenin antibodies, followed by counterstaining with 
0.1mmol/L 4,6-diamino-2-phenylindole. Probe detection for two-color experiments was accomplished by incubating 
the slides in fluorescein isothiocyanate (FITC)-anti-digoxigenin antibodies (Sigma Chemical Co.) and Texas red-avidin 
(Vector Laboratories, Burlingame, CA) or rhodamine-anti-digoxigenin and FITC-avidin. 

[0110] Multi-color digital image analysis was used for acquisition, display and quantification of hybridization signals 
of metaphase chromosomes. The system contains a PXL camera (Photometries Inc., Tucson, AZ) attached to a Pow- 
erMac 7100/Av workstation. IPLab software controls the camera operation, image acquisition and Ludl Filter wheel. 
At least 50 nuclei were scored. Overlapping nuclei and clusters of cells were ignored. A slide containing normal lym- 
phocyte metaphase spreads and interphase nuclei was included in each experiment to control for the efficiency and 
specificity of the hybridization. 

[0111] In order to determine the chromosomal localization of the human VEGF-C gene, DNAs from human rodent 
somatic cell hybrids containing defined sets of human chromosomes were analyzed by Southern blotting and hybrid- 
ization with the VEGF-C cDNA probe. Among 24 DNA samples on the hybrid panel, representing different human 
chromosomes, human-specific signals were observed only in hybrids which contained human chromosome 4. The 
results were confirmed by PCR of somatic cell hybrid DNAs using VEGF-C specific primers, where amplified bands 
were obtained only from DNAs containing human chromosome 4. 

[0112] A genomic P1 plasmid for VEGF-C was isolated using specific primers and PCR and verified by Southern 
blotting and hybridization using a VEGF-C specific cDNA probe. The chromosomal localization of VEGF-C was further 
studied using metaphase FISH. Using the P1 probe for VEGF-C in FISH a specific hybridization to the 4q34 chromo- 
somal band was detected in 40 out of 44 metaphases. Double-fluorochrome hybridization using a cosmid probe specific 
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for the aspartylglucosaminidase (AGA) gene showed that VEGF-C is located just proximal to the AGA gene previously 
mapped to the 4q34-35 chromosomal band. 

[0113] Biotin labelled VEGF-C P1 and digoxigenin labeled AGA cosmid probes were hybridized simultaneously to 
metaphase chromosomes. This experiment demonstrated that the AGA gene is more telomerically located than the 
5 VEGF-C gene. The foregoing example demonstrates the utility of polynucleotides of the invention as chromosomal 
markers and for the presence or absence of the VEGF-C gene region in normal or diseased cells. The VEGF-C locus 
at 4q34 is a candidate target for mutations leading to vascular malformations or cardiovascular diseases. 

EXAMPLE 18 

10 

Effect of glucose concentration and hypoxia on VEGF, VEGF-B and VEGF-C mRNA levels in C6 glioblastoma 
cells 

[01 1 4] Confluent cultures of C6 cells (ATCC CCL 1 07) were grown on 1 0 cm diameter tissue culture plates containing 
15 2.5 ml of DMEM and 5% fetal calf serum plus antibiotics. The cultures were exposed for 16 hours to normoxia in a 
normal cell culture incubator containing 5% C0 2 or hypoxia by closing the culture plates in an airtight glass chamber 
and burning a piece of wood inside until the flame was extinguished due to lack of oxygen. Polyadenylated RNA was 
isolated (as in the other examples) , and 8 micrograms of the RNA was electrophoresed and blot-hybridized with a 
mixture of the VEGF, VEGF-B and VEGF-C probes (see Fig. 1 2). The results show that hypoxia strongly induces VEGF 
20 mRNA expression, both in low and high glucose, but has no significant effect on the VEGF-B mRNA levels. The VEGF-C 
mRNA isolated from hypoxic cells runs slightly faster in gel electrophoresis and an extra band of faster mobility can 
be seen below the upper mRNA band. This observation suggests that hypoxia affects VEGF-C RNA processing. One 
explanation for this observation is that VEGF-C mRNA splicing is altered, affecting the VEGF-C open reading frame 
and resulting in an alternative VEGF-C protein being produced by hypoxic cells. Such alternative forms of VEGF-C 
25 and VEGF-C-encoding polynucleotides are contemplated as an aspect of the invention. This data indicates screening 
and diagnostic utilities for polynucleotides and polypeptides of the invention, such as methods whereby a biological 
sample is screened for the hypoxia-induced form of VEGF-C and/or VEGF-C mRNA. The data further suggests a 
therapeutic indication for antibodies and/or other inhibitors of the hypoxia-induced form of VEGF-C or the normal form 
of VEGF-C. 

30 

EXAMPLE 19 

Pulse-chase labeling and immunoprecipitation of VEGF-C polypeptides from 293 EBNA cells transfected with 
VEGF-C expression vector. 

35 

[01 1 5] The following VEGF-C branched amino-terminal peptide, designated PAM1 26, was synthesized for production 
of anti-VEGF-C antiserum: 

40 NH 2 -E-E-T-I-K-F-A-A-A-H-Y-N-T-E-I-L-K-COOH (SEQ ID 

NO: 39) • 

In particular, PAM126 was synthesized as a branched polylysine structure K3PA4 having four peptide acid (PA) chains 
45 attached to two available lysine (K) residues. The synthesis was performed on a 433A Peptide Synthesizer (Applied 
Biosystems) using Fmoc-chemistry and TentaGel S MAP RAM10 resin mix (RAPP Polymere GmbH, Tubingen, Ger- 
many), yielding both cleavable and resin-bound peptides. The cleavable peptide was purified via reverse phase HPLC 
and was used together with the resin-bound peptide in immunizations. The correctness of the synthesis products were 
confirmed using mass-spectroscopy (Lasermatt). 
50 [0116] The PAM126 peptide was dissolved in phosphate buffered saline (PBS), mixed with Freund's adjuvant, and 
used for immunization of rabbits at bi-weekly intervals using methods standard in the art (Harlow and Lane, Antibodies, 
a laboratory manual, Cold Spring Harbor Laboratory Press (1988)). Antisera obtained after the fourth booster immu- 
nization was used for immunoprecipitation of VEGF-C in pulse-chase experiments, as described below. 
[0117] For pulse-chase analysis, 293 EBNA cells transfected with a VEGF-C expression vector (i.e., the FLT4-L 
55 cDNA inserted into the pREP7 expression vector as described above) were incubated for 30 minutes in methionine- 
free, cysteine-free, serum-free DMEM culture medium at 37°C. The medium was then changed, and 200 juCi of Pro- 
mix™ (Amersham), was added. The cell layers were incubated in this labeling medium for two hours, washed with 
PBS, and incubated for 0, 15, 30, 60, 90, 120, or 180 minutes in serum-free DMEM (chase). After the various chase 
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periods, the medium was collected, the cells were again washed two times in PBS, and lysed in immunoprecipitation 
buffer. The VEGF-C polypeptides were analyzed from both the culture medium and from the cell lysates by immuno- 
precipitation, using the VEGF-C-specific antiserum raised against the NH 2 -terminal peptide (PAM126) of the 23 kD 
VEGF-C form. Immunoprecipitated polypeptides were analyzed via SDS-PAGE followed by autoradiography. 
[0118] Referring to Fig. 19, the resultant autoradiograms demonstrate that immediately after a 2 hour labeling (chase 
time 0), the VEGF-C vector-transfected cells contained a radioactive 55 kD polypeptide band, which is not seen in 
mock-transfected cells (M). This 55 kD polypeptide band gradually diminishes in intensity with increasing chase periods, 
and is no longer detected in the cells by 1 80 minutes of chase. A 32 kD polypeptide band also is observed in VEGF-C 
transfected cells (and not mock-transfected cells). This 32 kD band disappears with similar kinetics to that of the 55 
kD band. Simultaneously, increasing amounts of 32 kD (arrow) and subsequently 23 kD (arrow) and 1 4 kD polypeptides 
appear in the medium. 

[0119] Collectively, the data from the pulse-chase experiments indicate that the 55 kD intracellular polypeptide rep- 
resents a pro-VEGF-C polypeptide, which is not secreted from cells, but rather is first proteolytically cleaved into the 
32 kD form. The 32 kD form is secreted and simultaneously further processed by proteolysis into the 23 kD and 14 kD 
forms. Without intending to be limited to a particular theory, it is believed that processing of the VEGF-C precursor 
occurs as removal of a signal sequence, removal of the COOH-terminal domain (BR3P), and removal of an amino 
terminal polypeptide, resulting in a VEGF-C polypeptide having the TEE... amino terminus. 

[0120] At high resolution, the 23 kD polypeptide band appears as a closely-spaced polypeptide doublet, suggesting 
heterogeneity in cleavage or glycosylation. 

EXAMPLE 20 

Isolation of Mouse and Quail cDNA Clones Encoding VEGF-C 

[0121] To clone a mouse variant of VEGF-C, approximately 1 x 10 6 bacteriophage lambda clones of a commercially- 
available 12 day mouse embryonal cDNA library (lambda EXIox library, Novagen, catalog number 69632-1) were 
screened with a radiolabeled fragment of human VEGF-C cDNA containing nucleotides 495 to 1661 of SEQ ID NO: 
32. One positive clone was isolated. 

[0122] A 1323 bp EcoRI/H/ndlll fragment of the insert of the isolated mouse cDNA clone was subcloned into the 
corresponding sites of the pBluescript SK+ vector (Stratagene) and sequenced. The cDNA sequence of this clone was 
homologous to the human VEGF-C sequence reported herein, except that about 710 bp of 5'-end sequence present 
in the human clone was not present in the mouse clone. 

[0123] For further screening of mouse cDNA libraries, a Bind\\\-BstX\ (Hind\\\ site is from the pBluescript SK+ polylink- 
er) fragment of 881 bp from the coding region of the mouse cDNA clone was radiolabeled and used as a probe to 
screen two additional mouse cDNA libraries. Two additional cDNA clones from an adult mouse heart ZAP II cDNA 
library (Stratagene, catalog number 936306) were identified. Three additional clones also were isolated from a mouse 
heart 5'-stretch-plus cDNA library in Xgt11 (Clontech Laboratories, Inc., catalog number ML5002b). Of the latter three 
clones, one was found to contain an insert of about 1.9 kb. The insert of this cDNA clone was subcloned into EcoRI 
sites of pBluescript SK+ vector and both strands of this clone were completely sequenced, resulting in the nucleotide 
and deduced amino acid sequences shown in SEQ ID NOs: 40 and 41 . 

[0124] It is contemplated that the polypeptide corresponding to SEQ ID NO: 41 is processed into a mature mouse 
VEGF-C protein, in a manner analogous to the processing of the human VEGF-C prepropeptide. Putative cleavage 
sites for the mouse protein are identified using procedures outlined above for identification of cleavage sites for the 
human VEGF-C polypeptide. 

[0125] The foregoing results demonstrate the utility of polynucleotides of the invention for identifying and isolating 
polynucleotides encoding other non-human mammalian variants of VEGF-C. Such identified and isolated polynucle- 
otides, in turn, can be expressed (using procedures similar to those described in preceding examples) to produce 
recombinant polypeptides corresponding to non-human mammalian variants of VEGF-C. 

[0126] The mouse and human VEGF-C sequences were used to design probes for isolating a quail VEGF-C cDNA 
from a quail cDNA library. A fragment of the human VEGF-C cDNA comprising nucleotides 495-1 670 of SEQ ID NO: 
32 was obtained by PCR amplification, cloned into the pCRII vector (Invitrogen) according to the manufacturer's in- 
structions, and amplified. The insert was isolated by Eco Rl digestion and preparative gel electrophoresis and then 
labelled using radioactive dCTP and random priming. A cDNA library made from quail embryos of stage E-4 in pcDNA- 
1 vector (Invitrogen) was then screened using this probe. About 200,000 colonies were plated and filter replicas were 
hybridized with the radioactive probe. Nine positive clones were identified and secondarily plated. Two of the nine 
clones hybridized in secondary screening. The purified clones (clones 1 and 14) had approximately 2.7 kb Eco Rl 
inserts. Both clones were amplified and then sequenced using the T7 and SP6 primers (annealing to the vector). In 
addition, an internal Sph I restriction endonuclease cleavage site was identified about 1.9 kb from the T7 primer side 
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of the vector and used for subcloning 5'- and 3'- Sph I fragments, followed by sequencing from the Sph I end of the 
subclones. The sequences obtained were identical from both clones and showed a, high degree of similarity to the 
human VEGF-C coding region. Subsequently, walking primers were made in both directions and double-stranded se- 
quencing was completed for 1743 base pairs, including the full-length open reading frame. 

5 [0127] The cDNA sequence obtained includes a long open reading frame and 5' untranslated region. The DNA and 
deduced amino acid sequences for the quail cDNA are set forth in SEQ ID NOs: 52 and 53, respectively. As shown in 
Fig. 8, the human, murine, and avian (quail) VEGF-C precursor amino acid sequences share a significant degree of 
conservation. This high degree of homology permits the isolation of VEGF-C encoding sequences from other species, 
especially vertebrate species, and more particularly mammalian and avian species, using polynucleotides of the 

10 present invention as probes and using standard molecular biological techniques such as those described herein. 

EXAMPLE 21 

N-terminal peptide sequence analyses of recombinant VEGF-C 

15 

[0128] Cells (293 EBNA) transfected with VEGF-C cDNA (see Example 13) secrete several forms of recombinant 
VEGF-C (Fig. 21 A, lane IP). In the absence of alkylation, the three major, proteolytically-processed forms of VEGF-C 
migrate in SDS-PAGE as proteins with apparent molecular masses of 32/29 kD (doublet), 21 kD and 1 5 kD. Two minor 
polypeptides exhibit approximate molecular masses of 63 and 52 kD, respectively. One of these polypeptides is pre- 
20 sumably a glycosylated and non-processed form; the other polypeptide is presumably glycosylated and partially proc- 
essed. 

[0129] To determine sites of proteolytic cleavage of the VEGF-C precursor, an immunoaffinity column was used to 
purify VEGF-C polypeptides from the conditioned medium of 293 EBNA cells transfected with VEGF-C cDNA. To pre- 
pare the immunoaffinity column, a rabbit was immunized with a synthetic peptide corresponding to amino acids 1 04-1 20 
25 of SEQ ID NO: 33: H 2 N-EETIKFAAAHYNTEILK (see PAM126 in Example 19). The IgG fraction was isolated from the 
serum of the immunized rabbit using protein A Sepharose (Pharmacia). The isolated IgG fraction was covalently bound 
to CNBr-activated Sepharose CL-4B (Pharmacia) using standard techniques at a concentration of 5 mg IgG/ml of 
Sepharose. This immunoaffinity matrix was used to isolate processed VEGF-C from 1 .2 liters of the conditioned medium 
(CM). 

30 [0130] The purified material eluted from the column was analyzed by gel electrophoresis and Western blotting. Frac- 
tions containing VEGF-C polypeptides were combined, dialyzed against 10 mM Tris HCI, vacuum-dried, electrotrans- 
ferred to Immobilon-P (polyvinylidene difluoride or PVDF) transfer membrane (Millipore, Marlborough, MA) and sub- 
jected to N-terminal amino acid sequence analysis. 

[0131] The polypeptide band of 32 kD yielded two distinct sequences: NH 2 -FESGLDLSDA... and NH 2 -AVVMTQT- 
35 PAS... (SEQ ID NO: 51 ), the former corresponding to the N-terminal part of VEGF-C after cleavage of the signal peptide, 
starting from amino acid 32 (SEQ ID NO: 33), and the latter corresponding to the kappa-chain of IgG, which was present 
in the purified material due to "leakage" of the affinity matrix during the elution procedure. 

[0132] In order to obtain the N-terminal peptide sequence of the 29 kD form of VEGF-C, a construct (VEGF-C NHis) 
encoding a VEGF-C variant was generated. In particular, the construct encoded a VEGF-C variant that fused a 6xHis 
40 tag to the N-terminus of the secreted precursor (i.e., between amino acids 31 and 33 in SEQ ID NO: 33). The pheny- 
lalanine at position 32 was removed to prevent possible cleavage of the tag sequence during secretion of VEGF-C. 
The VEGF-C NHis construct was cloned into pREP7 as a vector; the construction is described more fully in Example 
28, below. 

[0133] The calcium phosphate co-precipitation technique was used to transfect VEGF-C NHis into 293 EBNA cells. 

45 Cells were incubated in DM EM/10% fetal calf serum in 15 cm cell culture dishes (a total of 25 plates). The following 
day, the cells were reseeded into fresh culture dishes (75 plates) containing the same medium and incubated for 48 
hours. Cell layers were then washed once with PBS and DM EM medium lacking FCS was added. Cells were incubated 
in this medium for 48 hours and the medium was collected, cleared by centrifugation at 5000 x g and concentrated 
500X using an Ultrasette Tangential Flow Device (Filtron, Northborough, MA), as described in Example 5 above. 

50 VEGF-C NHis was purified from the concentrated conditioned medium using TALON™ Metal Affinity Resin (Clontech 
Laboratories, Inc.) and the manufacturer's protocol for native protein purification using imidazole-containing buffers. 
The protein was eluted with a solution containing 20 mM Tris-HCI (pH 8.0), 1 00 mM NaCI, and 200 mM imidazole. The 
eluted fractions containing purified VEGF-C NHis were detected by immunoblotting with Antiserum 882 (antiserum 
from rabbit 882, immunized with the PAM-126 polypeptide). Fractions containing VEGF-C NHis were combined, dia- 

55 lyzed and vacuum-dried. As can be seen in Fig. 27, due to the presence of the 6xHis tag at the N-terminus of this form 
of VEGF-C, the upper component of the major doublet of the VEGF-CNHis migrates slightly slower than the 32 kD 
form of wild type VEGF-C, thereby improving the separation of the VEGF-CNHis 32 kD variant from the 29 kD band 
using SDS-PAGE. Approximately 15 jug of the purified VEGF-C were subjected to SDS-PAGE under reducing condi- 
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tions, electrotransferred to Immobilon-P (PVDF) transfer membrane (Millipore, Inc., Marlborough, MA) and the band 
at 29 kD was subjected to N-terminal amino acid sequence analysis. This sequence analysis revealed an N-terminal 
sequence of H 2 N-SLPAT..., corresponding to amino acids 228-232 of VEGF-C (SEQ ID NO: 33). 
[0134] The polypeptide band of 21 kD yielded the sequence H 2 N-AHYNTEILKS corresponding to an amino-ter- 
5 minus starting at amino acid 112 of SEQ ID NO: 33. Thus, the proteolytic processing site which results in the 21 kD 
form of VEGF-C produced by transfected 293 EBNA cells apparently occurs nine amino acid residues downstream of 
the cleavage site which results in the 23 kD form of VEGF-C secreted by PC-3 cells. 

[0135] The N-terminus of the 15 kD form was identical to the N-terminus of the 32 kD form (NH 2 -FESGLDLSDA...). 
The 1 5 kD form was not detected when recombinant VEG F-C was produced by COS cells. This suggests that production 
10 of this form is cell lineage specific. 

Example 22 

Dimeric and monomeric forms of VEGF-C 

15 

[0136] The composition of VEGF-C dimers was analyzed as follows. Cells (293 EBNA cells), transfected with the 
pREP7 VEGF-C vector as described in Example 11, were metabolically labelled with Pro-mix L-[ 35 S] labelling mix 
(Amersham Corp.) to a final concentration of 1 00 juCi/ml. 

[0137] In parallel, a VEGF-C mutant, designated "R102S", was prepared and analyzed. To prepare the DNA encoding 
20 VEGF-C-R102S, the arginine codon at position 102 of SEQ ID NO: 33 was replaced with a serine codon. This 
VEGF-C-R1 02S-encoding DNA, in a pREP7 vector, was transfected into 293 EBNA cells and expressed as described 
above. VEGF-C polypeptides were immunoprecipitated using antisera 882 (obtained by immunization of a rabbit with 
a polypeptide corresponding to residues 1 04-1 20 of SEQ I D NO: 33 (see previous Example)) and antisera 905 (obtained 
by immunization of a rabbit with a polypeptide corresponding to a portion of the prepro- VEGF-C leader: H 2 N-ESGLD- 
25 LS D A E P DAG E ATAYAS K (residues 33 to 54 of SEQ ID NO: 33). 

[0138] The immunoprecipitates from each cell culture were subjected to SDS-PAGE under non-denaturing conditions 
(Fig. 21 B). Bands 1 -6 were cut out from the gel, soaked for 30 minutes in 1 x gel-loading buffer containing 200 mM (3- 
mercaptoethanol, and individually subjected to SDS-PAGE under denaturing conditions (Figs. 21 A and 21 C, lanes 1 -6). 
[0139] As can be seen from Figures 21A-C, each high molecular weight form of VEGF-C (Fig. 21 B, bands 1-4) 
30 consists of at least two monomers bound by disulfide bonds (Compare Figs. 21 A and 21 C, lanes 1 -4, in the reducing 
gels). The main component of bands 1-3 is the doublet of 32/29 kD, where both proteins are present in an equimolar 
ratio. The main fraction of the 21 kD form is secreted as either a monomer or as a homodimer connected by means 
other than disulfide bonds (bands 6 and lanes 6 in Figs. 21 A-C). 

[0140] The R102S mutation creates an additional site for N-linkedglycosylation in VEGF-C at the asparagine residue 
35 at position 100 in SEQ ID NO: 33. Glycosylation at this additional glycosylation site increases the apparent molecular 
weight of polypeptides containing the site, as confirmed in Figures 21 A-C and Figures 22A-B. The additional glyco- 
sylation lowers the mobility of forms of VEGF-C-R102S that contain the additional glycosylation site, when compared 
to polypeptides of similar primary structure corresponding to VEGF-C. Figures 21 A-C and Figures 22A-B reveal that 
the VEGF-C-R1 02S polypeptides corresponding to the 32 kD and 1 5 kD forms of wt VEGF-C exhibit increased apparent 
40 molecular weights, indicating that each of these polypeptides contains the newly introduced glycosylation site. In par- 
ticular, the VEGF-C-R102S polypeptide corresponding to the 15 kD polypeptide from VEGF-C comigrates on a gel 
with the 21 kD form of the wild type (wt) VEGF-C, reflecting a shift on the gel to a position corresponding to a greater 
apparent molecular weight. (Compare lanes 4 in Figures 21 A and 21 C). 

[0141] In a related experiment, another VEGF-C mutant, designated "R226,227S," was prepared and analyzed. To 
45 prepare a DNA encoding VEGF-C-R226,2227S, the arginine codons at positions 226 and 227 of SEQ ID NO: 33 were 
replaced with serine codons by site-directed mutagenesis. The resultant DNA was transfected into 293 EBNA cells as 
described above and expressed and analyzed under the same conditions as described for VEGF-C and 
VEGF-C-R1 02S. In the conditioned medium from the cells expressing VEGF-C-R226,227S, no 32 kD form of VEGF-C 
was detected. These results indicate that a C-terminal cleavage site of wild-type VEGF-C is adjacent to residues 226 
50 and 227 of SEQ ID NO: 33, and is destroyed by the mutation of the arginines to serines. Again, the mobility of the 29 
kD component of the doublet was unchanged (Figures 22A-B). 

[0142] Taken together, these data indicate that the major form of the processed VEGF-C is a heterodimer consisting 
of (1 ) a polypeptide of 32 kD containing amino acids 32-227 of the prepro-VEGF-C (amino acids 32 to 227 in SEQ ID 
NO: 33) attached by disulfide bonds to (2) a polypeptide of 29 kD beginning with amino acid 228 in SEQ ID NO: 33. 
55 These data are also supported by a comparison of the pattern of immunoprecipitated, labelled VEGF-C forms using 
antisera 882 and antisera 905. 

[0143] When VEGF-C immunoprecipitation was carried out using conditioned medium, both antisera (882 and 905) 
recognized some or all of the three major processed forms of VEGF-C (32/29 kD, 21 kD and 1 5 kD). When the condi- 
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tioned medium was reduced by incubation in the presence of 10 mM dithiothreitol for two hours at room temperature 
with subsequent alkylation by additional incubation with 25 mM iodoacetamide for 20 minutes at room temperature, 
neither antibody precipitated the 29 kD component, although antibody 882 still recognized polypeptides of 32 kD, 21 
kD and 15 kD. These results are consistent with the nature of the oligopeptide antigen used to elicit the antibodies 

5 contained in antisera 882, an oligopeptide containing amino acid residues 104-120 of SEQ ID NO: 33. On the other 
hand, antisera 905 recognized only the 32 kD and 15 kD polypeptides, which include sequence of the oligopeptide 
(amino acids 33 to 54 of SEQ ID NO: 33) used for immunization to obtain antisera 905. Taking into account the mobility 
shift of the 32 kD and 1 5 kD forms, the immunoprecipitation results with the R1 02S mutant were similar (Figs. 23A-B). 
The specificity of antibody 905 is confirmed by the fact that it did not recognize a VEGF-C AN variant form wherein the 

10 N-terminal propeptide spanning residues 32-102 of the unprocessed polypeptide had been deleted (Fig. 23B). 

[0144] The results of these experiments also demonstrate that the 21 kD polypeptide is found (1) in heterodimers 
with other molecular forms (see Figs. 21 A-C and Figs. 22A-B) , and (2) secreted as a monomer or a homodimer held 
by bonds other than disulfide bonds (Figs. 21 A and 21 B, lanes 6). 

[0145] The experiments disclosed in this example demonstrate that several forms of VEGF-C exist. A variety of 
15 VEGF-C monomers were observed and these monomers can vary depending on the level and pattern of glycosylation. 
In addition, VEGF-C was observed as a multimer, for example a homodimer or a heterodimer. The processing of 
VEGF-C is schematically presented in Fig. 18 (disulfide bonds not shown). 

Example 23 

20 

In situ Hybridization of Mouse Embryos 

[0146] To analyze VEGF-C mRNA distribution in different cells and tissues, sections of 12.5 and 14.5-day post-coitus 
(p.c.) mouse embryos were prepared and analyzed via in situ hybridization using labeled VEGF-C probes. In situ 

25 hybridization of tissue sections was performed as described in Vastrik et al., J. Ceil Biol., 728:1197-1208 (1995). A 
mouse VEGF-C antisense RNA probe was generated from linearized pBluescript II SK+ plasmid (Stratagene Inc., La 
Jolla, CA), containing a cDNA fragment corresponding to nucleotides 499-979 of a mouse VEGF-C cDNA (SEQ ID 
NO: 40). Radiolabeled RNA was synthesized using T7 polymerase and [ 35 S]-UTP (Amersham). Mouse VEGF-B anti- 
sense and sense RNA probes were synthesized in a similar manner from linearized pCRII plasmid containing the 

30 mouse VEGF-B cDNA insert as described Olofsson et al., Proc. Natl. Acad. Sci. (USA) , 93:2576-2581 (1996). The 
high stringency wash was for 45 minutes at 65°C in a solution containing 30 mM dithiothreitol (DTT) and 4 x SSC. The 
slides were exposed for 28 days, developed and stained with hematoxylin. For comparison, similar sections were 
hybridized with a VEGFR-3 probe and the 12.5-day p.c. embryos were also probed for VEGF-B mRNA. 
[0147] Figures 34A-D show darkfield (Figures 34A-C) and lightfield (Figure 34D) photomicrographs of 12.5 day p.c. 

35 embryo sections probed with the antisense (Fig. 34A) and sense (Figs. 34C-D) VEGF-C probes. Fig. 34A illustrates a 
parasagittal section, where VEGF-C mRNA is particularly prominent in the mesenchyme around the vessels surround- 
ing the developing metanephros (mn). In addition, hybridization signals were observed between the developing verte- 
brae (vc), in the developing lung mesenchyme (lu), in the neck region and developing forehead. The specificity of these 
signals is evident from the comparison with VEGF-B expression in an adjacent section (Fig. 34B), where the myocar- 

40 dium gives a very strong signal and lower levels of VEGF-B mRNA are detected in several other tissues. Both genes 
appear to be expressed in between the developing vertebrae (vc), in the developing lung (lu) and forehead. Hybridi- 
zation of the VEGF-C sense probe showed no specific expression within these structures (Fig. 34C). 
[0148] Figs. 35A-D show a comparison of the expression patterns of VEGF-C and VEGFR-3 in 1 2.5 day p.c. mouse 
embryos in the jugular region, where the developing dorsal aorta and cardinal vein are located. This is the area where 

45 the first lymphatic vessels sprout from venous sac-like structures according to the long-standing theory of Sabin, Am. 
J. Anal, 9:43-91 (1909). As can be seen from Figs. 35A-D, an intense VEGF-C signal is detected in the mesenchyme 
surrounding the developing venous sacs (Figs. 35A and 35C) which are positive for VEGFR-3 (Figs. 35B and 35D). 
[0149] The mesenterium supplies the developing gut with blood and contains developing lymphatic vessels. The 
developing 14.5 day p.c. mesenterium is positive for VEGF-C mRNA, with particularly high expression in connective 

50 tissue surrounding certain vessels (arrowheads in Figs. 35E-H). This signal in Fig. 35E should be distinguished from 
the false positive reflection of light from red blood cells within the vessel. The adjacent mesenterial VEGFR-3 signals 
shown in Fig. 35F originate from small capillaries of the mesenterium (arrowhead). Therefore, there appears to be a 
paracrine relationship between the production of the mRNAs for VEGF-C and its receptor. This data indicates that 
VEGF-C is expressed in a variety of tissues. Moreover, the pattern of expression is consistent with a role for VEGF-C 

55 in venous and lymphatic vessel development. Further, the data reveals that VEGF-C is expressed in non-human ani- 
mals. 
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Example 24 

Analysis of VEGF, VEGF-B, and VEGF-C mRNA Expression in Fetal and Adult Tissues 

5 [0150] A human fetal tissue Northern blot containing 2 jug of polyadenylated RNAs from brain, lung, liver and kidney 
(Clontech Inc.) was hybridized with a pool of the following probes: a human full-length VEGF-C cDNA insert (Genbank 
Acc. No. X94216), a human VEGF-B 167 cDNA fragment (nucleotides 1-382, Genbank Acc. No. U48800) obtained by 
PCR amplification; and a human VEGF 581 bp cDNA fragment covering base pairs 57-638 (Genbank Acc. No. X1 5997). 
Blots were washed under stringent conditions, using techniques standard in the art. 

10 [0151] Mouse embryo multiple tissue Northern blot (Clontech Inc.) containing 2 jug of polyadenylated RNAs from 7, 
11,15 and 1 7 day postcoital (p.c.) embryos was hybridized with mouse VEGF-C cDNA fragment (base pairs 499-656). 
A mouse adult tissue Northern blot was hybridized with the probes for human VEGF, VEGF-B 167 , VEGF-C and with a 
VEGFR-3 cDNA fragment (nucleotides 1-595; Genbank Acc. No. X68203). 

[0152] In adult mouse tissues, both 2.4 kb and 2.0 kb mRNA signals were observed with the VEGF-C probe, at an 
15 approximately 4:1 ratio. The most conspicuous signals were obtained from lung and heart RNA, while kidney, liver, 
brain, and skeletal muscle had lower levels, and spleen and testis had barely visible levels. As in the human tissues, 
VEGF mRNA expression in adult mice was most abundant in lung and heart RNA, whereas the other samples showed 
less coordinate regulation with VEGF-C expression. Skeletal muscle and heart tissues gave the highest VEGF-B mRNA 
levels from adult mice, as previously reported Olofsson et al., Proc. Natl. Acad. Sci . (USA), 93:2576-2581 (1996). 
20 Comparison with VEGFR-3 expression showed that the tissues where VEGF-C is expressed also contain mRNA for 
its cognate receptor tyrosine kinase, although in the adult liver VEGFR-3 mRNA was disproportionally abundant. 
[0153] To provide a better insight into the regulation of the VEGF-C mRNA during embryonic development, polya- 
denylated RNA isolated from mouse embryos of various gestational ages (7, 11, 15, and 17 day p.c.) was hybridized 
with the mouse VEGF-C probe. These analyses showed that the amount of 2.4 kb VEGF-C mRNA is relatively constant 
25 throughout the gestational period. 

Example 25 

Regulation of mRNAs for VEGF family members by serum, interleukin-1 and dexamethasone in human 
30 fibroblasts in culture 

[0154] Human IMR-90 fibroblasts were grown in DMEM medium containing 10% FCS and antibiotics. The cells were 
grown to 80% confluence, then starved for 48 hours in 0.5 % FCS in DMEM. Thereafter, the growth medium was 
changed to DMEM containing 5% FCS, with or without 1 0 ng/ml interleukin-1 (IL-1 ) and with or without 1 mM dexam- 

35 ethasone, as indicated in Figs. 24A-B. The culture plates were incubated with these additions for the times indicated, 
and total cellular RNA was isolated using the TRIZOL kit (GIBCO-BRL). About 20 \iq of total RNA from each sample 
was electrophoresed in 1 .5% formaldehyde-agarose gels as described in Sambrook et al., supra (1 989). The gel was 
used for Northern blotting and hybridization with radiolabeled insert DNA from the human VEGF clone (a 581 bp cDNA 
covering bps 57-638, Genbank Acc. No. 15997) and a human VEGF-B 167 cDNA fragment (nucleotides 1-382, Genbank 

40 Acc. No. U48800) (Fig. 25B). Subsequently, the Northern blots were probed with radiolabeled insert from the VEGF-C 
cDNA plasmid (Fig. 24A) . Primers were labelled using a standard technique involving enzymatic extension reactions 
of random primers, as would be understood by one of ordinary skill in the art. The mobilities of the 28S and 18S 
ribosomal RNA bands are indicated, based on UV photography of ethidium bromide stained RNA before the transfer. 
[0155] As can be seen in Figs. 24A-B, very low levels of VEGF-C and VEGF are expressed by the starved IMR-90 

45 cells as well as cells after 1 hour of stimulation. In contrast, abundant VEGF-B mRNA signal is visible under these 
conditions. After a 4 hours of serum stimulation, there is a strong induction of VEGF-C and VEGF mRNAs, which are 
further increased in the IL-1 treated sample. The effect of IL-1 seems to be abolished in the presence of dexamethasone. 
A similar pattern of enhancement is maintained in the 8 hour sample, but a gradual down-regulation of all signals occurs 
for both RNAs in the 24 hour and 48 hour samples. In contrast, VEGF-B mRNA levels remain constant and thus show 

50 remarkable stability throughout the time period. The results are useful in guiding efforts to use VEGF-C and its frag- 
ments, its antagonists, and anti-VEGF-C antibodies in methods for treating a variety of disorders. 

Example 26 

55 Expression and analysis of recombinant murine VEGF-C 

[0156] The mouse VEGF-C cDNA was expressed as a recombinant protein and the secreted protein was analyzed 
for its receptor binding properties. The binding of mouse VEGF-C to the human VEGFR-3 extracellular domain was 
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studied by using media from Bosc23 cells transfected with mouse VEGF-C cDNA in a retroviral expression vector. 
[0157] The 1.8 kb mouse VEGF-C cDNA was cloned as an EcoRI fragment into the retroviral expression vector 
pBabe-puro containing the SV40 early promoter region [Morgenstern et al., Nucl. Acids Res,, 78:3587-3595 (1990)], 
and transfected into the Bosc23 packaging cell line [Pearet et al., Proc. Natl. Acad. Sci. (USA), 90:8392-8396 (1994)] 

5 by the calcium-phosphate precipitation method. For comparison, Bosc23 cells also were transfected with the previ- 
ously- described human VEGF-C construct in the pREP7 expression vector. The transfected cells were cultured for 
48 hours prior to metabolic labelling. Cells were changed into DMEM medium devoid of cysteine and methionine, and, 
after 45 minutes of preincubation and medium change, Pro-mix™ L-[ 35 S] in vitro cell labelling mix (Amersham Corp.), 
in the same medium, was added to a final concentration of about 120 |uCi/ml. After 6 hours of incubation, the culture 

10 medium was collected and clarified by centrifugation. 

[0158] For immunoprecipitation, 1 ml aliquots of the media from metabolically-labelled Bosc23 cells transfected with 
empty vector or mouse or human recombinant VEGF-C, respectively, were incubated overnight on ice with 2 jliI of 
rabbit polyclonal antiserum raised against an N-terminal 1 7 amino acid oligopeptide of mature human VEGF-C (H 2 N-EE- 
TIKFAAAHYNTEILK) (SEQ ID NO: 33, residues 104-120). Thereafter, the samples were incubated with protein A 

15 sepharose for 40 minutes at 4°C with gentle agitation. The sepharose beads were then washed twice with immuno- 
precipitation buffer and four times with 20 mM Tris-HCI, pH 7.4. Samples were boiled in Laemmli buffer and analyzed 
by 12.5% sodium dodecyl sulfate polyacrylamide gel electrophoresis (SDS-PAGE). 

[0159] Immunoprecipitation of VEGF-C from media of transfected and metabolically-labelled cells revealed bands 
of approximately 30-32x1 0 3 M r (a doublet) and 22-23x1 0 3 M r in 12.5% SDS-PAGE. These bands were not detected 

20 in samples from nontransfected or mock-transfected cells as shown in Fig. 32 (i.e., lanes marked "vector"). These 
results show that antibodies raised against human VEGF-C recognize the corresponding mouse ligand. 
[0160] For receptor binding experiments, 1 ml aliquots of media from metabolically-labelled Bosc23 cells were incu- 
bated with VEGFR-3 extracellular domain (see Example 3) , covalently coupled to sepharose, for 4 hours at 4°C with 
gentle mixing. The sepharose beads were washed four times with ice-cold phosphate buffered saline (PBS), and the 

25 samples were analyzed by gel electrophoresis as described in Joukov et al., EMBO J., 75:290-298 (1 996). 

[0161] As can be seen from Fig. 32, similar 30-32 x 10 3 M r doublet and 22-23 x 10 3 M r polypeptide bands were 
obtained in the receptor binding assay as compared to the immunoprecipitation assay. Thus, mouse VEGF-C binds to 
human VEGFR-3. The slightly faster mobility of the mouse VEGF-C polypeptides may be caused by the four amino 
acid residue difference observed in sequence analysis (residues H88-E91, Fig. 31). 

30 [0162] The capacity of mouse recombinant VEGF-C to induce VEGFR-3 autophosphorylation was also investigated. 
For the VEGFR-3 receptor stimulation experiments, subconfluent NIH 3T3-Flt4 cells, Pajusola et al., Oncogene, 9: 
3545-3555 (1994), were starved overnight in serum-free medium containing 0.2% BSA. In general, the cells were 
stimulated with the conditioned medium from VEGF-C vector-transfected cells for 5 minutes, washed three times with 
cold PBS containing 200 jliM vanadate, and lysed in RIPA buffer for immunoprecipitation analysis. The lysates were 

35 centrifuged for 25 minutes at 16000 x g and the resulting supernatants were incubated for 2 hours on ice with the 
specific antisera, followed by immunoprecipitation using protein A-sepharose and analysis in 7% SDS-PAGE. Polypep- 
tides were transferred to nitrocellulose and analyzed by immunoblotting using anti-phosphotyrosine (Transduction Lab- 
oratories) and anti-receptor antibodies, as described by Pajusola etai, Oncogene, 9:3545-3555 (1 994). Filter stripping 
was carried out at 50°C for 30 minutes in 1 00 mM 2-mercaptoethanol, 2% SDS, 62.5 mM Tris-HCI, pH 6.7, with occa- 

40 sional agitation. The results of the experiment are shown in Fig. 33. The results demonstrate that culture medium 
containing mouse VEGF-C stimulates the autophosphorylation of VEGFR-3 to a similar extent as human baculoviral 
VEGF-C or the tyrosyl phosphatase inhibitor pervanadate. 

[0163] VEGFR-2 stimulation was studied in subconfluent porcine aortic endothelial (PAE) cells expressing Kdr (VEG- 
FR-2) (PAE-VEGFR-2) [Waltenberger etai, J. Biol. Chem., 269:26988-26995 (1994)], which were starved overnight 
45 in serum-free medium containing 0.2% BSA. Stimulation was carried out and the lysates prepared as described above. 
For receptor immunoprecipitation, specific antiserum for VEGFR-2 [Waltenberger et al., J. Biol. Chem., 269: 
26988-26995 (1 994)] was used. The immunoprecipitates were analyzed as described for VEGFR-3 in 7% SDS-PAGE 
followed by Western blotting with anti-phosphotyrosine antibodies, stripping of the filter, and re-probing it with anti-VEG- 
FR-2 antibodies (Santa Cruz). 

50 [0164] Mouse VEGF-C appeared to be a potent inducer of VEGFR-3 autophosphorylation, with the 195x10 3 M r 
precursor and proteolytically cleaved 125x10 3 M r tyrosine kinase polypeptides of the receptor (Pajusola et al., Onco- 
gene, 9:3545-3555 (1994)), being phosphorylated. VEGFR-2 stimulation was first tried with unconcentrated medium 
from cells expressing recombinant VEGF-C, but immunoblotting analysis did not reveal any receptor autophosphor- 
ylation. 

55 [0165] To further determine whether mouse recombinant VEGF-C can also induce VEGFR-2 autophosphorylation 
as observed for human VEGF-C, PAE cells expressing VEGFR-2 were stimulated with tenfold concentrated medium 
from cultures transfected with mouse VEGF-C expression vector and autophosphorylation was analyzed. For compar- 
ison, cells treated with tenfold concentrated medium containing human recombinant VEGF-C (Joukov et al., (1996)), 
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unconcentrated medium from human VEGF-C baculovirus infected insect cells, or pervanadate (a tyrosyl phosphatase 
inhibitor) were used. As can be seen from Fig. 33, in response to human baculoviral VEGF-C as well as pervanadate 
treatment, VEGFR-2 was prominently phosphorylated, whereas human and mouse recombinant VEGF-C gave a weak 
and barely detectable enhancement of autophosphorylation, respectively. Media from cell cultures transfected with 
5 empty vector or VEGF-C cloned in the antisense orientation did not induce autophosphorylation of VEGFR-2. Therefore, 
mouse VEGF-C binds to VEGFR-3 and activates this receptor at a much lower concentration than needed for the 
activation of VEGFR-2. Nevertheless, the invention comprehends methods for using the materials of the invention to 
take advantage of the interaction of VEGF-C with VEGFR-2, in addition to the interaction between VEGF-C and VEGFR- 
3. 

10 

Example 27 

VEGF-C E1 04-S21 3 fragment expressed in Pichia yeast stimulates autophosphorylation of Flt4 (VEGFR-3) and 
KDR (VEGFR-2) 

15 

[0166] A truncated form of human VEGF-C cDNA was constructed wherein (1) the sequence encoding residues of 
a putative mature VEGF-C amino terminus H 2 N-E(104) ETIK (SEQ ID NO: 33, residues 104 et seq.) was fused in- 
frame to the yeast PHOI signal sequence (Invitrogen Pichia Expression Kit, Catalog #K1 71 0-01 ), and (2) a stop codon 
was introduced after amino acid 213 (H 2 N- ...RCMS; i.e., after codon 213 of SEQ ID NO: 32). The resultant truncated 

20 cDNA construct was then inserted into the Pichia pastoris expression vector pHIL-S1 (Invitrogen). For the cloning, an 
internal Bgl\ I site in the VEGF-C coding sequence was mutated without change of the encoded polypeptide sequence. 
[0167] This VEGF-C expression vector was then transfected into Pichia cells and positive clones were identified by 
screening for the expression of VEGF-C protein in the culture medium by Western blotting. One positive clone was 
grown in a 50 ml culture, and induced with methanol for various periods of time from 0 to 60 hours. About 10 jllI of 

25 medium was analyzed by gel electrophoresis, followed by Western blotting and detection with anti-VEGF-C antiserum, 
as described above. As can be seen in Figure 25, an approximately 24 kD polypeptide (note the band spreading due 
to glycosylation) accumulates in the culture medium of cells transfected with the recombinant VEGF-C construct, but 
not in the medium of mock-transfected cells or cells transfected with the vector alone. 

[0168] The medium containing the recombinant VEGF-C protein was concentrated by Centricon 30 kD cutoff ultra- 
30 filtration and used to stimulate NIH 3T3 cells expressing Flt4 (VEGFR-3) and porcine aortic endothelial (PAE) cells 
expressing KDR (VEGFR-2). The stimulated cells were lysed and immunoprecipitated using VEGFR-specific antisera 
and the immunoprecipitates were analyzed by Western blotting using anti-phosphotyrosine antibodies, chemilumines- 
cence, and fluorography. As a positive control for maximal autophosphorylation of the VEGFRs, vanadate (V0 4 ) treat- 
ment of the cells for 10 minutes was used. As can be seen from the results shown in Fig. 26, medium from Pichia 
35 cultures secreting the recombinant VEGF-C polypeptide induces autophosphorylation of both Flt41 polypeptides of 
195 kD and 125 kD as well as the KDR polypeptide of about 200 kD. Vanadate, on the other hand, induces heavy 
tyrosyl phosphorylation of the receptor bands in addition to other bands probably coprecipitating with the receptors. 
[0169] These results demonstrate that a VEGF-homologous domain of VEGF-C consisting of amino acid residues 
104E - 21 3S (SEQ ID NO: 33, residues 104-213) can be recombinantly produced in yeast and is capable of stimulating 
40 the autophosphorylation of Flt4 (VEGFR-3) and KDR (VEGFR-2). Recombinant VEGF-C fragments such as the frag- 
ment described herein, which are capable of stimulating Flt4 or KDR autophosphorylation are intended as aspects of 
the invention 

Example 28 

45 

Properties of the differentially processed forms of VEGF-C 

[0170] The following oligonucleotides were used to generate a set of VEGF-C variants: 

50 5'- TCTCTTCTGTGCTTGAGTTGAG -3' (SEQ ID NO: 42), used to generate VEGF-C R1 02S (arginine mutated to 

serine at position 102 (SEQ ID NO: 33)); 

S'-TCTCTTCTGTCCCTGAGTTGAG -3' (SEQ ID NO: 43), used to generate VEGF-C R102G (arginine mutated to 
glycine at position 102 (SEQ ID NO: 33)); 

5'-TGTGCTGCAGCAAATTTTATAGTCTCTTCTGTGGCGGCGGC GGCGGCGGGCGCCTCGCGAGGACC -3' 
55 (SEQ ID NO: 44), used to generate VEGF-C AN (deletion of N-terminal propeptide corresponding to amino acids 

32-102 (SEQ ID NO: 33)); 

5'- CTGGCAGGGAACTGCTAATAATGGAATGAA - 3' (SEQ ID NO: 45), used to generate VEGF-C R226,227S 
(arginine codons mutated to serines at positions 226 and 227 (SEQ ID NO: 33)); 
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5'-GGGCTCCGCGTCCGAGAGGTCGAGTCCGGACTCGTGATGGT GATGGTGATGGGCGGCGGCGGCG- 
GCGGGCGCCTCGCGAGGACC -3' (SEQ ID NO: 46), used to generate VEGF-C NHis (this construct encodes a 
polypeptide with a 6xHis tag fused to the N-terminus of the secreted precursor (amino acid 32 of SEQ ID NO: 33)). 

5 [0171] Some of the foregoing VEGF-C variant constructs were further modified to obtain additional constructs. For 
example, VEGF-C R102G in pALTER (Promega) and oligonucleotide 5 ' - GTATTATA ATGTCCTC C AC C A A ATTTTATAG 
-3' (SEQ ID NO: 47) were used to generate VEGF-C 4G, which encodes a polypeptide with four point mutations: R1 02G, 
A11 0G, A1 11 G, and A1 1 2G (alanines mutated to glycines at positions 1 1 0-1 1 2 (S EQ I D NO: 33). These four mutations 
are adjacent to predicted sites of cleavage of VEGF-C expressed in PC-3 and recombinantly expressed in 293 EBNA 

10 cells. 

[0172] Another construct was created using VEGF-C AN and oligonucleotide 5'-GTTCGCTGCCTGACACTGTGG- 
TAGTGTTGCTGGC GGCCGCTAGTGATGGTGATGGTGATGAATAATGGAATGAACTTGTCTGTAAACATCC AG -3' 
(SEQ ID NO: 48) to generate VEGF-C ANACHis. This construct encodes a polypeptide with a deleted N-terminal. 
propeptide (amino acids 32-102); a deleted C-terminal propeptide (amino acids 226-419 of SEQ ID NO: 33); and an 

15 added 6xHis tag at the C-terminus. 

[0173] All constructs were further digested with Hind\\ \ and Not\, subcloned into Hind\\\/Not\ digested pREP7 vector, 
and used to transfect 293 EBNA cells. About 48 hours after transfection, the cells were either metabolically labelled 
with Pro-mix™ as described above, or starved in serum-free medium for 2 days. Media were then collected and used 
in subsequent experiments. As can be seen from Figs. 27A-B, wild type (wt) VEGF-C, VEGF-C NHis and VEGF-C 

20 ANACHis were expressed to similar levels in 293 EBNA cells. At the same time, expression of the VEGF-C 4G polypep- 
tide was considerably lower, possibly due to the changed conformation and decreased stability of the translated product. 
However, all the above VEGF-C variants were secreted from the cells (compare Figs. 27A and 27B). The conditioned 
media from the transfected and starved cells were concentrated 5-fold and used to assess their ability to stimulate 
tyrosine phosphorylation of Flt4 (VEGFR-3) expressed in NIH 3T3 cells and KDR (VEGFR-2) expressed in PAE cells. 

25 [0174] Figs. 28A-B show that wild type (wt) VEGF-C, as well as all three mutant polypeptides, stimulate tyrosine 
phosphorylation of VEGFR-3. The most prominent stimulation is by the short mature VEGF-C ANACHis. This mutant, 
as well as VEGF-C NHis, also stimulated tyrosine phosphorylation of VEGFR-2. Thus, despite the fact that a major 
component of secreted recombinant VEGF-C is a dimer of 32/29 kD, the active part of VEGF-C responsible for its 
binding to VEGFR-3 and VEGFR-2 is localized between amino acids 102 and 226 (SEQ ID NO: 33) of the VEGF-C 

30 precursor. Analysis and comparison of binding properties and biological activities of these VEGF-C proteins and var- 
iants, using assays such as those described herein, will provide data concerning the significance of the observed major 
32/29 kD and 21-23 kD VEGF-C processed forms. The data indicate that constructs encoding amino acid residues 
103-225 of the VEGF-C precursor (SEQ ID NO: 33) generate a recombinant ligand that is functional for both VEGFR- 
3 and VEGFR-2. 

35 [0175] The data from this and preceding examples demonstrate that numerous fragments of the VEGF-C polypeptide 
retain biological activity. A naturally occurring VEGF-C polypeptide spanning amino acids 103-226 (or 1 03-227) of SEQ 
ID NO: 33, produced by a natural processing cleavage defining the C-terminus, has been shown to be active. Example 
27 demonstrates that a fragment with residues 104-213 of SEQ ID NO: 33 retains biological activity. 
[0176] In addition, data from Example 21 demonstrates that a VEGF-C polypeptide having its amino terminus at 

40 position 112 of SEQ ID NO: 33 retains activity. Additional experiments have shown that a fragment lacking residues 
1-112 of SEQ ID NO: 33 retains biological activity. 

[0177] In a related experiment, a stop codon was substituted for the lysine at position 214 of SEQ ID NO: 33 (SEQ 
ID NO: 32, nucleotides 991-993). The resulting recombinant polypeptide still was capable of inducing Flt4 autophos- 
phorylation, indicating that a polypeptide spanning amino acid residues 113-213 of SEQ ID NO: 33 is biologically active. 

45 [01 78] Sequence comparisons of members of the VEGF family of polypeptides provides an indication that still smaller 
fragments of the polypeptide depicted in SEQ ID NO: 33 will retain biological activity. In particular, eight highly conserved 
cysteine residues of the VEGF family of polypeptides define a region from residues 131 - 211 of SEQ ID NO: 33 (see 
Figure 31 ) of evolutionary signficance; therefore, a polypeptide spanning from about residue 1 31 to about residue 211 
is expected to retain VEGF-C biological activity. In fact, a polypeptide which retains the conserved motif RCXXCC (e. 

50 g., a polypeptide comprising from about residue 161 to about residue 211 of SEQ ID NO: 33 is postulated to retain 
VEGF-C biological activity. To maintain native conformation of these fragments, it may be preferred to retain about 1 -2 
additional amino acids at the carboxy-terminus and 1-2 or more amino acids at the amino terminus. 
[0179] Beyond the preceding considerations, evidence exists that smaller fragments and/or fragment variants which 
lack the conserved cysteines nonetheless will retain VEGF-C biological activity. 
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Example 29 

Expression of human VEGF-C under the human K14 keratin promoter in transgenic mice induces abundant 
growth of lymphatic vessels in the skin 

5 

[0180] The Flt4 receptor tyrosine kinase is relatively specifically expressed in the endothelia of lymphatic vessels. 
Kaipainen etai., Proc. Natl . Acad. Sci. (USA), 92. 3566-3570 (1995). Furthermore, the VEGF-C growth factor stimu- 
lates the Flt4 receptor, showing less activity towards the KDR receptor of blood vessels (Joukov et af., EMBO J., 15: 
290-298 (1996); See Example 26). 

10 [0181] Experiments were conducted in transgenic mice to analyze the specific effects of VEGF-C overexpression in 
tissues. The human K14 keratin promoter is active in the basal cells of stratified squamous epithelia (Vassar et al., 
Proc. Natl. Acad. Sci. (USA), 86:1563-1567 (1 989)) and was used as the expression control element in the recombinant 
VEGF-C transgene. The vector containing the K1 4 keratin promoter is described in Vassar et al., Genes Dev., 5:71 4-727 
(1991) and Nelson etai, J. Cell Biol. 97:244-251 (1983). 

15 [0182] The recombinant VEGF-C transgene was constructed using the human full length VEGF-C cDNA (GenBank 
Acc. No. X9421 6). This sequence was excised from a pCI-neo vector (Promega) with Xho\/Not\, and the resulting 2027 
base pair fragment containing the open reading frame and stop codon (nucleotides 352-1611 of SEQ ID NO: 32) was 
isolated. The isolated fragment was then subjected to an end-filling reaction using the Klenow fragment of DN A polymer- 
ase. The blunted fragment was then ligated to a similarly opened BamHI restriction site in the K1 4 vector. The resulting 

20 construct contained the EcoRI site derived from the polylinker of the pCI-neo vector. This EcoRI site was removed 
using standard techniques (a Klenow-mediated fill-in reaction following partial digestion of the recombinant intermediate 
with EcoRI) to facilitate the subsequent excision of the DNA fragment to be injected into fertilized mouse oocytes. The 
resulting clone, designated K14-VEGF-C, is illustrated in Fig. 20. 

[0183] The EcoRI-H/ndlll fragment from clone K14 VEGF-C containing the K14 promoter, VEGF-C cDNA, and K14 
25 polyadenylation signal was isolated and injected into fertilized oocytes of the FVB-NIH mouse strain. The injected 
zygotes were transplanted to oviducts of pseudopregnant C57BL/6 x DBA/2J hybrid mice. The resulting founder mice 
were analyzed for the presence of the transgene by polymerase chain reaction of tail DNA using the primers: 5'-CAT- 
GTACGAACCGCCAG-3' (SEQ ID NO: 49) and 5'-AATGACCAGAGAGAGGCGAG-3' (SEQ ID NO: 50). In addition, 
the tail DNAs were subjected to EcoRV digestion and subsequent Southern analysis using the EcoRI-H/ndlll fragment 
30 injected into the mice. Out of 8 pups analyzed at 3 weeks of age, 2 were positive, having approximately 40-50 copies 
and 4-6 copies of the transgene in their respective genomes. 

[0184] The mouse with the high copy number transgene was small, developed more slowly than its litter mates and 
had difficulty eating (i.e., suckling). Further examination showed a swollen, red snout and poor fur. Although fed with 
a special liquid diet, it suffered from edema of the upper respiratory and digestive tracts after feeding and had breathing 
35 difficulties. This mouse died eight weeks after birth and was immediately processed for histology, immunohistochem- 
istry, and In situ hybridization. 

[0185] Histological examination showed that in comparison to the skin of littermates, the dorsal dermis of 
K14-VEGF-C transgenic mice was atrophic and connective tissue was replaced by large lacunae devoid of red cells, 
but lined with a thin endothelial layer (white arrows in Figs. 29A-D). These distended vessel-like structures resembled 

40 those seen in human lymphangiomas. The number of skin adnexal organs and hair follicles were reduced. In the snout 
region, an increased number of vessels was also seen. Therefore, VEGF-C overexpression in the basal epidermis is 
capable of promoting the growth of extensive vessel structure in the underlying skin, including large vessel lacunae. 
The endothelial cells surrounding these lacunae contained abundant Flt4 mRNA in in situ hybridization (see Examples 
23 and 30 for methodology). The vessel morphology indicates that VEGF-C stimulates the growth of vessels having 

45 features of lymphatic vessels. The other K14-VEGF-C transgenic mouse had a similar skin histopathology. 

[0186] The foregoing in vivo data indicates utilities for both (i) VEGF-C polypeptides and polypeptide variants having 
VEGF-C biological activity, and (ii) anti-VEGF-C antibodies and VEGF-C antagonists that inhibit VEGF-C activity (e. 
g., by binding VEGF-C or interfering with VEGF-C/receptor interactions. For example, the data indicates a therapeutic 
utility for VEGF-C polypeptides in patients wherein growth of lymphatic tissue may be desirable (e.g., in patients fol- 

50 lowing breast cancer or other surgery where lymphatic tissue has been removed and where lymphatic drainage has 
therefore been compromised, resulting in swelling; or in patients suffering from elephantiasis). The data indicates a 
therapeutic utility for anti-VEGF-C antibody substances and VEGF-C antagonists for conditions wherein growth-inhi- 
bition of lymphatic tissue may be desirable (e.g., treatment of lymphangiomas) . 
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Example 30 

Expression of VEGF-C and Flt4 in the Developing Mouse 

5 [0187] Embryos from a 1 6-day post-coitus pregnant mouse were prepared and fixed in 4% paraformaldehyde (PFA), 
embedded in paraffin, and sectioned at 6 jum. The sections were placed on silanated microscope slides and treated 
with xylene, rehydrated, fixed for 20 minutes in 4% PFA, treated with proteinase K (7mg/ml; Merck, Darmstadt, Ger- 
many) for 5 minutes at room temperature, again fixed in 4% PFA and treated with acetic anhydride, dehydrated in 
solutions with increasing ethanol concentrations, dried and used for in situ hybridization. 

10 [0188] In situ hybridization of sections was performed as described (Vastrikef a/., J. Cell Biol., 728:1197-1208(1995)). 
A mouse VEGF-C antisense RNA probe was generated from linearized pBluescript II SK+ plasmid (Stratagene Inc.), 
containing a fragment corresponding to nucleotides 499-979 of mouse VEGF-C cDNA, where the noncoding region 
and the BR3P repeat were removed by Exonuclease III treatment. The fragment had been cloned into the EcoRI and 
Hindlll sites of pBluescript II SK + . Radiolabeled RNA was synthesized using T7 RNA Polymerase and [ 35 S]-UTP (Am- 

15 ersham, Little Chalfont, UK) . About two million cpm of the VEGF-C probe was applied per slide. After an overnight 
hybridization, the slides were washed first in 2x SSC and 20-30 mM DDT for 1 hour at 50°C. Treatment continued with 
a high stringency wash, 4x SSC and 20 mM DTT and 50% deionized formamide for 30 minutes at 65°C followed by 
RNase A treatment (20 |ig/ml) for 30 minutes at 37°C. The high stringency wash was repeated for 45 minutes. Finally, 
the slides were dehydrated and dried for 30 minutes at room temperature. The slides were dipped into photography 

20 emulsion and exposed for 4 weeks. Slides were developed using Kodak D-16 developer, counterstained with hema- 
toxylin and mounted with Permount (FisherCbemical). 

[0189] For in situ hybridizations of Flt4 sequences, a mouse Flt4 cDNA fragment covering bp 1 -1 92 of the published 
sequence (Finnerty etai, Oncogene, 8:2293-2298 (1993)) was used, and the above-described protocol was followed, 
with the following exceptions. Approximately one million cpm of the Flt4 probe were applied to each slide. The stringent 

25 washes following hybridization were performed in 1x SSC and 30 mM DTT for 105 minutes. 

[0190] The figure shows photomicrographs of the hybridized sections in dark field microscopy (Figs. 36A-C) and 
light field microscopy (Fig. 36D). Magnifications used for photography were 4x for Figs. 36A-B and 1 0x for Figs. 36C-D. 
The transverse sections shown are from the cephalic region and the area shown for VEGF-C and FLT4 are about 14 
sections apart, Flt4 being more cranial ly located in the embryo. In Fig. 36A (Flt4 probe), the developing nasopharyngeal 

30 cavity is in the midline in the upper, posterior part; in the anterior part of Fig. 36A is the snout with emerging fibrissal 
follicles and, in the midline, the forming nasal cavity. On both sides, the retinal pigment gives a false positive signal in 
dark field microscopy. The most prominently Flt4-hybridizing structures appear to correspond to the developing lym- 
phatic and venous endothelium. Note that a plexus-like endothelial vascular structure surrounds the developing na- 
sopharyngeal mucous membrane. In Fig. 36B, the most prominent signal is obtained from the posterior part of the 

35 developing nasal conchae, which in higher magnification (Figs. 36C-D) show the epithelium surrounding loose con- 
nective tissue/forming cartilage. This structure gives a strong in situ hybridization signal for VEGF-C. Also in Fig. 3.6B, 
more weakly hybridizing areas can be seen around the snout, although this signal is much more homogeneous in 
appearance. Thus, the expression of VEGF-C is strikingly high in the developing nasal conchae. 
[0191] The conchae are surrounded with a rich vascular plexus, important in nasal physiology as a source for the 

40 mucus produced by the epithelial cells and for warming inhaled air. It is suggested that VEGF-C is important in the 
formation of the concheal venous plexus at the mucous membranes, and that it may also regulate the permeability of 
the vessels needed for the secretion of nasal mucus. Possibly, VEGF-C and its derivatives, and antagonists, could be 
used in the regulation of the turgor of the conchal tissue and mucous membranes and therefore the diameter of the 
upper respiratory tract, as well as the quantity and quality of mucus produced. These factors are of great clinical sig- 

45 nificance in inflammatory (including allergic) and infectious diseases of the upper respiratory tract. Accordingly, the 
invention contemplates the use of the materials of the invention, in methods of diagnosing and treating inflammatory 
and infectious conditions affecting the upper respiratory tract, including nasal structures. 

Example 31 

50 

Characterization of the exon-intron organization of the human VEGF-C gene 

[0192] Two genomic DNA clones covering exons 1 , 2, and 3 of the human VEGF-C gene were isolated from a human 
genomic DNA library using VEGF-C cDNA fragments as probes. In particular, a human genomic library in bacteriophage 
55 EMBL-3 lambda (Clontech) was screened using a PCR-generated fragment corresponding to nucleotides 629-746 of 
the human VEGF-C cDNA (SEQ ID NO: 32). One positive clone, designated "lambda 3," was identified, and the insert 
was subcloned as a 14 kb Xho\ fragment into the pBSK II vector (Stratagene). The genomic library also was screened 
with a labeled 130 bp Not\-Sac\ fragment from the 5'-noncoding region of the VEGF-C cDNA (the Not\ site is in the 
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polylinker of the cloning vector; the Sad site corresponds to nucleotides 92-97 of SEQ ID NO: 32). Two positive clones, 
designated "lambda 5" and "lambda 8," were obtained. Restriction mapping analysis showed that clone lambda 3 
contains exons 2 and 3, while clone lambda 5 contains exon 1 and the putative promoter region. 
[0193] Three genomic fragments containing exons 4, 5, 6 and 7 were subcloned from a genomic VEGF-C P1 plasmid 
5 clone. In particular, purified DNA from a genomic P1 plasmid clone 7660 (Paavonen etal., Circulation, 93: 1079-1082 
(1996)) was used. EcoRI fragments of the P1 insert DNA were ligated into pBSK II vector. Subclones of clone 7660 
which contained human VEGF-C cDNA homologous sequences were identified by colony hybridization, using the full- 
length VEGF-C cDNA as a probe. Three different genomic fragments were identified and isolated, which contained 
the remaining exons 4-7. 

w [0194] To determine the genomic organization, the clones were mapped using restriction endonuclease cleavage. 
Also, the coding regions and exon-intron junctions were partially sequenced. The result of this analysis is depicted in 
Figures 11 and 17. The sequences of all intron-exon boundaries (Fig. 17, SEQ ID NOs: 57-68) conformed to the 
consensus splicing signals (Mount, Nuci. Acids Res., 10: 459-472 (1982)). The length of the intron between exon 5 
and 6 was determined directly by nucleotide sequencing and found to be 301 bp. The length of the intron between 

15 exons 2 and 3 was determined by restriction mapping and Southern hybridization and was found to be about 1 .6 kb. 
Each of the other introns was over 10 kb in length. 

[0195] A similar analysis was performed for the murine genomic VEGF-C gene. The sequences of murine VEGF-C 
intron-exon boundaries are depicted in Figure 17 and SEQ ID NOs: 69-80. 

[0196] The restriction mapping and sequencing data indicated that the signal sequence and the first residues of the 
20 N-terminal propeptide are encoded by exon 1 . The second exon encodes the carboxy-terminal portion of the N-terminal 
propeptide and the amino terminus of the VEGF homology domain. The most conserved sequences of the VEGF 
homology domain are distributed in exons 3 (containing 6 conserved cysteine residues) and 4 (containing 2 cys resi- 
dues). The remaining exons encode cysteine-rich motifs of the type C-6X-C-10X-CRC (exons 5 and 7) and a fivefold 
repeated motif of type C-6X-B-3X-C-C-C, which is typical of a silk protein. 
25 [0197] To further characterize the VEGF-C gene promoter, the lambda 5 clone was further analyzed. Restriction 
mapping of this clone using a combination of single- and double-digestions and Southern hybridizations indicated that 
it includes: (1) an approximately 5 kb region upstream of the putative initiator ATG codon, (2) exon 1 , and (3) part of 
intron I of the VEGF-C gene. 

[0198] A 3.7 kb Xba I fragment of clone lambda 5, containing exon 1 and 5' and 3' flanking sequences, was subcloned 
30 and further analyzed. As reported previously, a major VEGF-C mRNA band migrates at a position of about 2.4 kb. 
Calculating from the VEGF-C coding sequence of 1 257 bp and a 391 bp 3' noncoding sequence plus a polyA sequence 
of about 50-200 bp, the mRNA start site should be located about 550-700 bp upstream of the translation initiation codon. 
[0199] To further characterize the promoter of the human VEGF-C gene, a genomic clone encompassing about 1 .4 
kb upstream of the translation initiation site was isolated, and the 5' noncoding cDNA sequence and putative promoter 
35 region were sequenced. The sequence obtained is set forth in SEQ ID NO: 54. Similar to what has been observed with 
the VEGF gene, the VEGF-C promoter is rich in G and C residues and lacks consensus TATA and CCAAT sequences. 
Instead, it has numerous putative binding sites for Sp1, a ubiquitous nuclear protein that can initiate transcription of 
TATA-less genes. See Pugh and Tjian, Genes and Dev., 5:105-119 (1991). In addition, sequences upstream of the 
VEGF-C translation start site were found to contain frequent consensus binding sites for the AP-2 factor. This suggests 
40 that the cAMP-dependent protein kinase and protein kinase C, as activators of AP-2 transcription factor [Curran and 
Franza, Cell, 55:395-397 (1988)], mediate VEGF-C transcriptional regulation. 

[0200] The VEGF-C gene is abundantly expressed in adult human tissues, such as heart, placenta, ovary and small 
intestine, and is induced by a variety of factors. Indeed, several potential binding sites for regulators of tissue-specific 
gene expression, like NFkB and GATA, are located in the distal part of the VEGF-C promoter. For example, NFkB is 
45 known to regulate the expression of tissue factor in endothelial cells. Also, transcription factors of the GATA family are 
thought to regulate cell-type specific gene expression. 

[0201] Unlike VEGF, the VEGF-C gene does not contain a binding site for the hypoxia-inducible factor, HIF-1 (Levy 
et ai, J. Biol. Chem., 270: 13333-13340 (1995)). This finding suggests that if the VEGF-C mRNA is regulated by 
hypoxia, the mechanism would be based mainly on the regulation of mRNA stability. In this regard, numerous studies 

50 have shown that the major control point for the hypoxic induction of the VEGF gene is the regulation of the steady- 
state level of mRNA. See Levy etal., J. Biol. Chem., 271: 2746-2753 (1 996). The relative rate of VEGF mRNA stability 
and decay is considered to be determined by the presence of specific sequence motifs in its 3' untranslated region 
(UTR) , which have been demonstrated to regulate mRNA stability (Chen and Shyu, Mol. Cell Biol., 14: 8471-8482 
(1 994)). The 3'-UTR of the VEGF-C gene also contains a putative motif of this type (TTATTT), at positions 1 873-1 878 

55 of SEQ ID NO: 32. 
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Example 32 

Identification of a VEGF-C splice variant 

5 [0202] As reported in Example 16, a major 2.4 kb VEGF-C mRNA and smaller amounts of a 2.0 kb mRNA are 
observable. To clarify the origin of these RNAs, several additional VEGF-C cDNAs were isolated and characterized. 
A human fibrosarcoma cDNA library from HT1 080 cells in the lambda gt1 1 vector (Clontech, product #HL1 048b) was 
screened using a 153 bp human VEGF-C cDNA fragment as a probe as described in Example 10. See also Joukov 
etal., EMBO J., 75:290-298 (1996). Nine positive clones were picked and analyzed by PCR amplification using oligo- 

10 nucleotides 5'-CACGGCTTATGCAAGCAAAG-3' (SEQ ID NO: 55) and 5'-AACACAGTTTTCCATAATAG-3' (SEQ ID 
NO: 56) These oligonucleotides were selected to amplify the portion of the VEGF-C cDNA corresponding to nucleotides 
495-1661 of SEQ ID NO: 32. PCR was performed using an annealing temperature of 55°C and 25 cycles. 
[0203] The resultant PCR products were electrophoresed on agarose gels. Five clones out of the nine analyzed 
generated PCR fragments of the expected length of 1147 base pairs, whereas one was slightly shorter. The shorter 

15 fragment and one of the fragments of expected length were cloned into the pCRTMII vector (Invitrogen) and analyzed 
by sequencing. The sequence revealed that the shorter PCR fragment had a deletion of 1 53 base pairs, corresponding 
to nucleotides 904 to 1055 of SEQ ID NO: 32. These deleted bases correspond to exon 4 of the human and mouse 
VEGF-C genes, schematically depicted in Fig. 17. Deletion of exon 4 results in aframeshift, which in turn results in a 
C-terminal truncation of the full-length VEGF-C precursor, with fifteen amino acid residues translated from exon 5 in 

20 a different frame than the frame used to express the full-length protein. Thus, the C-terminal amino acid sequence of 
the resulting truncated polypeptide would be -Leu (1 81)-Ser-Lys-Thr-Val-Ser-Gly-Ser-Glu-Gln-Asp-Leu-Pro-His-Glu- 
Leu-His-Val-Glu(199) (SEQ ID NO: 81). The VEGF-C variant encoded by this splice variant would not contain the C- 
terminal cleavage site of the VEGF-C precursor. Thus, a putative alternatively spliced RNA form lacking conserved 
exon 4 was identified in HT-1080 fibrosarcoma cells and this form is predicted to encode a protein of 199 amino acid 

25 residues, which could be an antagonist of VEGF-C. 

Example 33 

VEGF-C is similarly processed in different cell cultures in vitro 

30 

[0204] To study whether VEGF-C is similarly processed in different cell types, 293 EBNA cells, COS-1 cells and HT- 
1 080 cells were transfected with wild type human VEGF-C cDNA and labelled with Pro-Mix™ as described in Example 
22. The conditioned media from the cultures were collected and subjected to immunoprecipitation using antiserum 882 
(described in Example 21, recognizing a peptide corresponding to amino acids 104-120 of SEQ ID NO: 33). The im- 

35 munoprecipitated polypeptides were separated via SDS-PAGE, and detected via autoradiography. The major form of 
secreted recombinant VEGF-C observed from all cell lines tested is a 29/32 kD doublet. These two polypeptides are 
bound to each other by disulfide bonds, as described in Example 22. A less prominent band of approximately 21 kD 
also was detected in the culture media. Additionally, a non-processed VEGF-C precursor of 63 kDa was observed. 
This form was more prominent in the COS-1 cells, suggesting that proteolytic processing of VEGF-C in COS cells is 

40 less efficient than in 293 EBNA cells. Endogenous VEGF-C (in non-transfected cells) was not detectable under these 
experimental conditions in the HT-1080 cells, but was readily detected in the conditioned medium of the PC-3 cells. 
Analysis of the subunit polypeptide sizes and ratios in PC-3 cells and 293 EBNA cells revealed strikingly similar results: 
the most prominent form was a doublet of 29/32 kDa and a less prominent form the 21 kD polypeptide. The 21 kD form 
produced by 293 EBNA cells was not recognized by the 882 antibody in the Western blot, although it is recognized 

45 when the same antibody is used for immunoprecipitation (see data in previous examples). As reported in Example 21 , 
cleavage of the 32 kD form in 293 EBNA cells occurs between amino acid residues 111 and 112 (SEQ ID NO: 33), 
downstream of the cleavage site in PC-3 cells (between residues 102 and 103). Therefore, the 21 kD form produced 
in 293 EBNA cells does not contain the complete N-terminal peptide used to generate antiserum 882. 
[0205] In a related experiment, PC-3 cells were cultured in serum-free medium for varying periods of time (1 - 8 

50 days) prior to isolation of the conditioned medium. The conditioned medium was concentrated using a Centricon device 
(Amicon, Beverly, USA) and subjected to Western blotting analysis using antiserum 882. After one day of culturing, a 
prominent 32 kD band was detected. Increasing amounts of a 21 -23 kD form were detected in the conditioned media 
from 4 day and 8 day cultures. The diffuse nature of this polypeptide band, which is simply called the 23 kD polypeptide 
in example 5 and several subsequent examples is most likely due to a heterogenous and variable amount of glyco- 

55 sylation. These results indicate that, initially, the cells secrete a 32 kD polypeptide, which is further processed or cleaved 
in the medium to yield the 21-23 kD form. The microheterogeneity of this polypeptide band would then arise from the 
variable glycosylation degree and, from microheterogeneity of the processing cleavage sites, such as obtained for the 
amino terminus in PC-3 and 293 EBNA cell cultures. The carboxyl terminal cleavage site could also vary, examples of 
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possible cleavage sites would be between residues 225-226, 226-227 and 227-228 as well as between residues 
216-217. Taken together, these data suggest the possibility that secreted cellular protease(s) are responsible for the 
generation of the 21-23 kD form of VEGF-C from the 32 kD polypeptide. Such proteases could be used in vitro to 
cleave VEGF-C precursor proteins in solution during the production of VEGF-C, or used in cell culture and in vivo to 
5 release biologically active VEGF-C. 

Example 34 

Differential binding of VEGF-C forms by the extracellular domains of VEGFR-3 and VEGFR-2 

10 

[0206] In two parallel experiments, 293 EBNA cells were transfected with a construct encoding recombinant wild 
type VEGF-C or a construct encoding VEGF-C DNDCHis (Example 28) and about 48 hours after transfection, meta- 
bolically labelled with Pro-Mix™ as described in previous examples. The media were collected from mock-transfected 
and transfected cells and used for receptor binding analyses. 

15 [0207] Receptor binding was carried out in binding buffer (PBS, 0.5% BSA, 0.02% Tween 20, 1 microgram/ml heparin) 
containing approximately 0.2 microgram of either (a) a fusion protein comprising a VEGFR-3 extracellular domain fused 
to an immunoglobulin sequence (VEGFR-3-lg) or (b) a fusion protein comprising VEGFR-2 extracellular domain fused 
to an alkaline phosphatase sequence (VEGF-R-2-AP; Cao et ai, J. Biol. Chem. 277:3154-62 (1996)). As a control, 
similar aliquots of the 293 EBNA conditioned media were mixed with 2 jliI of anti-VEGF-C antiserum (VEGF-C IP). 

20 [0208] After incubation for 2 hours at room temperature, anti-VEGF-C antibodies and VEGFR-3-lg protein were 
adsorbed to protein A-sepharose (PAS) and VEGFR-2-AP was immunoprecipitated using anti-AP monoclonal antibod- 
ies (Medix Biotech, Genzyme Diagnostics, San Carlos, CA, USA) and protein G-sepharose. Complexes containing 
VEGF-C bound to VEGFR-3-lg or VEGFR-2-AP were washed three times in binding buffer, twice in 20 mM Tris-HCI 
(pH 7.4) and VEGF-C immunoprecipitates were washed three times in RIPA buffer and twice in 20 mM tris-HCI (pH 

25 7.4) and analyzed via SDS-PAGE under reducing and nonreducing conditions. As a control, the same media were 
precipitated with anti-AP and protein G-sepharose (PGS) or with PAS to control for possible nonspecific adsorption. 
[0209] These experiments revealed that VEGFR-3 bound to both the 32/29 kD and 21 -23 kD forms of recombinant 
VEGF-C, whereas VEGFR-2 bound preferentially to the 21 -23 kD component from the conditioned media. In addition, 
small amounts of 63 kD and 52 kD VEGF-C forms were observed binding with VEGFR-3. Further analysis under 

30 nonreducing conditions indicates that a great proportion of the 21-23 kD VEGF-C bound to either receptor does not 
contain interchain disulfide bonds. These findings reinforce the results that VEGF-C binds VEGFR-2. This data sug- 
gests a utility for recombinant forms of VEGF-C which are active towards VEGFR-3 only or which are active towards 
both VEGFR-3 and VEGFR-2. On the other hand, these results, together with the results in Example 28, do not eliminate 
the possibility that the 32/29 kD dimer binds VEGFR-3 but does not activate it. The failure of the 32/29 kD dimer to 

35 activate VEGFR-3 could explain the finding that conditioned medium from the N-His VEGF-C transfected cells induced 
a less prominent tyrosine phosphorylation of VEGFR-3 than medium from VEGF-C DNDCHis transfected cells, even 
though expression of the former polypeptide was much higher (see Figs. 27 and 28). Stable VEGF-C polypeptide 
variants that bind to a VEGF-C receptor but fail to activate the receptor are useful as VEGF-C antagonists. 
[021 0] Deposit of Biological Materials: Plasmid FLT4-L has been deposited with the American Type Culture Collection 

40 (ATCC), 12301 Parklawn Dr., Rockville MD 20952 (USA), pursuant to the provisions of the Budapest Treaty, and has 
been assigned a deposit date of 24 July 1 995 and ATCC accession number 97231 . 

SEQUENCE LISTING 

45 [0211] 

(1) GENERAL INFORMATION: 

(i) APPLICANT: Helsinki University Licensing Ltd Oy 

50 

(ii) TITLE OF INVENTION: Receptor Ligand VEGF-C 

(iii) NUMBER OF SEQUENCES: 81 
55 (jy) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Oy Jalo Ant-Wuorinen Ab 

(B) STREET: Iso Roobertinkatu 4-6 A 
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(C) CITY: Helsinki 

(E) COUNTRY: Finland 

(F) ZIP: 00120 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.30 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/671,573 

(B) FILING DATE: 28-JUN-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/601,132 

(B) FILING DATE: 14-FEB-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/585,895 

(B) FILING DATE: 12-JAN-1996 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: 08/510,133 

(B) FILING DATE: 01-AUG-1995 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Karvinen, Leena 

(C) REFERENCE/DOCKET NUMBER: 28999 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: +358 0 648 606 

(B) TELEFAX: +358 0 640 575 

(C) TELEX: 123505 JALO Fl 

(2) INFORMATION FOR SEQ ID NO:1 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:1: 



(2) INFORMATION FOR SEQ ID NO:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 70 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

ACATG CATGC CACCATGCAG CGGGGCGCCG CGCTCTGCCT GCGACTGT6G CTCTGCCTGG 
GACTCCTGGA 

(2) INFORMATION FOR SEQ ID NO:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:3: 

ACATGCATGC CCCGCCGGTC ATCC 

(2) INFORMATION FOR SEQ ID NO:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:4: 

CGGAATTCCC CATGACCCCA AC 

(2) INFORMATION FOR SEQ ID NO:5: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 33 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:5: 



CCATCGATGG ATCCTACCTG AAGCCGCTTT CTT 



(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:6: 



ATTTAGGTGA CACTATA 

(2) INFORMATION FOR SEQ ID NO: 7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 7 : 



CCATCGATGG ATCCCGATGC TGCTTAGTAG CTGT 

(2) INFORMATION FOR SEQ ID NO:8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 40 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:8: 
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Pro Met Tfar Pro Thr Thr Tyr Lys Gly Ser Val Asp Asn Gin Thr Asp 
15 10 15 

Ser Gly Met Val Leu Ala Ser Glu Glu Phe Glu Gin lie Glu Ser Arg 

5 20 25 30 

His Arg Gin Glu Ser Gly Phe Arg 
35 40 

10 (2) INFORMATION FOR SEQ ID NO:9 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 
15 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:9: 

CTGGAGTCGA CTTG GCGGAC T 

25 

(2) INFORMATION FOR SEQ ID NO:10: 
(i) SEQUENCE CHARACTERISTICS: 

30 (A) LENGTH: 60 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

35 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:10: 



40 CGCGGATCCC TAGTGATGOT GATGGTGATG TCTACCTTCG ATCATGCTGC CCTTATCCTC 60 

(2) INFORMATION FOR SEQ ID NO: 11 : 

(i) SEQUENCE CHARACTERISTICS: 

45 

(A) LENGTH: 34 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:11: 

55 

CCCAAGCTTG GATCCAAGTG GCTACTCCAT GACC 34 



(2) INFORMATION FOR SEQ ID NO:12: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:12: 

GTTG CCTGTG ATGTG CACCA 

(2) INFORMATION FOR SEQ ID NO:13: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:13: 



xaa Glu Glu Thr lie Lys Phe Ala Ala Ala His Tyr Asn Thr Glu He 
1 5 10 15 

L«eu Lys 



(2) INFORMATION FOR SEQ ID NO:14: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:14: 



GCAGARGARA CNATHAA 

(2) INFORMATION FOR SEQ ID NO:15: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:15: 

Glu Glu Tiur lie Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:16: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:16: 

GCAYTTNARD ATYTCNGT 

(2) INFORMATION FOR SEQ ID NO:17: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 5 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:17: 

Thr Glu lie Leu Lys 
1 5 

(2) INFORMATION FOR SEQ ID NO:18: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:18: 

ATTCGCTGCA GCACACTACA AC 

(2) INFORMATION FOR SEQ ID NO:19: 
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(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:19: 

TCNGTGTTOT AGTGTGCTG 

(2) INFORMATION FOR SEQ ID NO:20: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 7 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:20: 

Ala Ala His Tyr Asn Ttir Glu 

1 5 

(2) INFORMATION FOR SEQ ID NO: 21 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:21: 

TAATACGACT CACTATAGGG 

(2) INFORMATION FOR SEQ ID NO:22: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: cDNA 
5 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:22: 

GTTGTAGTGT GCTGCAGCGA A TTT 24 

10 (2) INFORMATION FOR SEQ ID NO: 23 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 8 amino acids 
15 (B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 



20 



25 



30 



35 



40 



(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:23: 

Lys Phe Ala Ala Ala His Tyr Asn 
1 5 

(2) INFORMATION FOR SEQ ID NO:24: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:24: 

TCACTATAGG GAGACCCAAG C 21 



(2) INFORMATION FOR SEQ ID NO:25: 
45 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 219 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
50 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:25: 

55 
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TCACTATAGG GAGACCCAAG CTTGGTACCG AGCTCGGATC CACTAGTAAC GGCCGCCAGT 60 

GTGGTGGAAT TCGACGAACT CATGACTGTA CTCTACCCAG AATATTGGAA AATGTACAAG 120 

TGTCAGCTAA GGCAAGGAGG CTGGCAACAT AACAGAGAAC AGGCCAACCT CAACTCAAGO 180 

ACAGAAGAGA CTATAAAATT CGCTGCAGCA CACTACAAC 219 



(2) INFORMATION FOR SEQ ID NO:26: 

w 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

15 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

20 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:26: 



ACAGAGAACA GGCCAACC 16 



25 (2) INFORMATION FOR SEQ ID NO:27: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 base pairs 
30 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

35 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:27: 



TCTAGCATTT AGGTGACAC 19 

40 

(2) INFORMATION FOR SEQ In NO:28: 
(i) SEQUENCE CHARACTERISTICS: 

45 (A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

50 (ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:28: 



AAGAGACTAT AAAATTCGCT GCAGC 25 

(2) INFORMATION FOR SEQ In NO:29 



43 



EP 0 842 273 B1 



(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:29: 



CCCTCTAGAT GCATGCTCGA 



(2) INFORMATION FOR SEQ ID NO:30: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:30: 



GTTGTAGTGT GCTGCAGCGA ATTT 

(2) INFORMATION FOR SEQ ID NO:31 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:31: 



TCACTATAGG GAGACCCAAG C 

(2) INFORMATION FOR SEQ ID NO:32: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1997 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 
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(A) NAME/KEY: CDS 

(B) LOCATION: 352.. 1608 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:32: 



CCCGCCCCGC CTCTCCAAAA AGCTACACCG ACGCGGACCG CGGCGGCGTC CTCCCTCGCC SO 

CTCGCTTCAC CTCGCGGGCT CCGAATGCGG GGAGCTCGGA TGTCCGGTTT CCTGTGAGGC 120 

10 TTTTACCTGA CACCCGCCGC CTTTCCCCGG CACTGGCTGG GAGGGCGCCC TGCAAAGTTG 180 

GGAACGCGGA GCCCCGGACC CGCTCCCGCC GCCTCCGGCT CGCCCAGGGG GGGTCGCCGG 240 

GAGGAGCCCG GGGGAGAGGG ACCAGGAGGG GCCCGCGGCC TCGCAGGGGC GCCCGCGCCC 300 

15 CCACCCCTGC CCCCGCCAGC GGACCGGTCC CCCACCCCCG GTCCTTCCAC C ATG CAC 357 

Met His 
1 

TTG CTG GGC TTC TTC TCT GTG GCG TGT TCT CTG CTC GCC GCT GCG CTG 405 
Leu Leu Gly Phe Phe Ser Val Ala Cys Ser Leu Leu Ala Ala Ala Leu 

20 5 10 15 

CTC CCG GGT CCT CGC GAG GCG CCC GCC GCC GCC GCC GCC TTC GAG TCC 453 
Leu Pro Gly Pro Arg Glu Ala Pro Ala Ala Ala Ala Ala Phe Glu Ser 
20 25 30 

25 GGA CTC GAC CTC TCG GAC GCG GAG CCC GAC GCG GGC GAG GCC ACG GCT 501 

Gly Leu Asp Leu Ser Asp Ala Glu Pro Asp Ala Gly Glu Ala Thr Ala 
35 40 45 50 

TAT GCA AGC AAA GAT CTG GAG GAG CAG TTA CGG TCT GTG TCC AGT GTA 549 
Tyr Ala Ser Lye Asp Leu Glu Glu Gin Leu Arg Ser Val Ser Ser Val 
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GAT GAA CTC ATG ACT GTA CTC TAC CCA GAA TAT TGG AAA ATG TAC AAG 597 
Asp Glu Leu Met Thr Val Leu Tyr Pro Glu Tyr Trp Lys Met Tyr Lys 
70 75 " 80 

5 

TGT CAG CTA AGG AAA GGA GGC TGG CAA CAT AAC AGA GAA CAG GCC AAC 645 
Cys Gin Leu Arg Lys Gly Gly Trp Gin His Asn Arg Glu Gin Ala Asn 
B5 90 95 

CTC AAC TCA AGG ACA GAA GAG ACT ATA AAA TTT GCT GCA GCA CAT TAT 693 
Leu Asn Ser Arg Thr Glu Glu Thr He Lys Phe Ala Ala Ala His Tyr 
100 * 105 110 

AAT ACA GAG ATC TTG AAA AGT ATT GAT AAT GAG TGG AGA AAG ACT CAA 741 
Asn Thr Glu lie Leu Lys Ser He Asp Asn Glu Trp Arg Lys Thr Gin 
115 120 125 ~ 130 

15 TGC ATG CCA CGG GAG GTG TGT ATA GAT GTG GGG AAG GAG TTT GGA GTC 789 

Cys Met Pro Arg Glu Val Cys He Asp Val Gly Lys Glu Phe Gly Val 
135 140 145 

GCG ACA AAC ACC TTC TTT AAA CCT CCA TGT GTG TCC GTC TAC AGA TGT 837 
Ala Thr Asn Thr Phe Phe Lys Pro Pro Cys Val Ser Val Tyr Arg Cys 

20 150 155 160 

GGG GGT TGC TGC AAT AGT GAG GGG CTG CAG TGC ATG AAC ACC AGO ACG 885 
Gly Gly Cys Cys Asn Ser Glu Gly Leu Gin Cys Met Asn Thr Ser Thr 
165 170 175 

25 AGC TAC CTC AGC AAG ACG TTA TTT GAA ATT ACA GTG CCT CTC TCT CAA 933 

Ser Tyr Leu Ser Lys Thr Leu Phe Glu He Thr Val Pro Leu Ser Gin 
180 185 190 

GGC CCC AAA CCA GTA ACA ATC AGT TTT GCC AAT CAC ACT TCC TGC GGA 981 

Gly Pro Lys Pro Val Thr He Ser Phe Ala Asn His Thr Ser Cys Arg 
195 200 205 210 

30 

TGC ATG TCT AAA CTG GAT GTT TAC AGA CAA GTT CAT TCC ATT ATT AGA 1029 

Cys Met Ser Lys Leu Asp Val Tyr Arg Gin Val His Ser He Xle Arg 
215 220 225 

CGT TCC CTG CCA GCA ACA CTA CCA CAG TGT CAG GCA GCG AAC AAG ACC 1077 
35 Arg Ser Leu Pro Ala Thr Leu Pro Gin Cys Gin Ala Ala Asn l»ys Thr 

230 23S 240 

TGC CCC ACC AAT TAC ATG TGG AAT AAT CAC ATC TGC AGA TGC CTG GCT 1125 
Cys Pro Thr Asn Tyr Met Trp Asn Asn His He Cys Arg Cys Leu Ala 
245 250 255 

40 CAG GAA GAT TTT ATG TTT TCC TCG GAT GCT GGA GAT GAC TCA ACA GAT 1173 

Gin Glu Asp Phe Met Phe Ser Ser Asp Ala Gly Asp Asp Ser Thr Asp 
260 265 270 

GGA TTC CAT GAC ATC TGT GGA CCA AAC AAG GAG CTG GAT GAA GAG ACC 1221 
Gly Phe His Asp He Cys Gly Pro Asn Lys Glu Leu Asp Glu Glu Thr 
45 275 280 285 290 

TGT CAG TGT GTC TGC AGA GCG GGG CTT CGG CCT GCC AGC TGT GGA CCC 1269 
Cys Gla Cys Val Cys Arg Ala Gly Leu Arg Pro Ala Ser Cys Gly Pro 
295 300 305 

CAC AAA GAA CTA GAC AGA AAC TCA TGC CAG TGT GTC TGT AAA AAC AAA 1317 
Kis Lys Glu Leu Asp Arg Asn Ser Cys Gin Cys Val Cys Lys Asn Lys 
310 315 320 

CTC TTC CCC AGC CAA TGT GGG GCC AAC CGA GAA TTT GAT GAA AAC ACA 1365 
Leu Phe Pro Ser Gin Cys Gly Ala Asn Arg Glu Phe Asp Glu Asn Thr 
325 330 335 

55 
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TGC CAG TGT GTA TGT AAA AGA ACC TGC CCC AGA AAT CAA CCC CTA AAT 
Cys Gin Cys Val Cys Lys Arg Thr Cys Pro Arg Asn Gin Pro Leu Asn 
340 345 350 

CCT GGA AAA TGT GCC TGT GAA TGT ACA GAA AGT CCA CAG AAA TGC TTG 
Pro Gly Lys Cys Ala Cys Glu Cys Thr Glu Ser Pro Gin Lys Cys Leu 
355 360 365 370 

TTA AAA GGA AAG AAG TTC CAC CAC CAA ACA TGC AGC TGT TAC AGA CGG 
Leu Lys Gly Lys Lys Phe His His Gin Thr Cys Ser Cys Tyr Arg Arg 
375 380 385 

CCA TGT ACG AAC CGC CAG AAG GCT TGT GAG CCA GGA TTT TCA TAT AGT 
Pro Cys Thr Asn Arg Gin Lys Ala Cys Glu Pro Gly Phe Ser Tyr Ser 
390 395 400 

GAA GAA GTG TGT CGT TGT GTC CCT TCA TAT TGG AAA AGA CCA CAA ATG 
Glu Glu Val Cys Arg Cys Val Pro Ser Tyr Trp Lys Arg Pro Gin Met 
405 4X0 415 

AGC TAAGATTGTA CTGTTTTCCA GTTCATCGAT TTTCTATTAT GGAAAACTGT 
Ser 

GTTGCCACAG TAGAACTGTC TGTGAACAGA GAGACCCTTG TGGGTCCATG CTAACAAAGA 
CAAAAGTCTG TCTTTCCTGA ACCATGTGGA TAACTTTACA GAAATGGACT GGAGCTCATC 
TGCAAAAGGC CTCTTGTAAA GACTGGTTTT CTGCCAATGA CCAAACAGCC AAGATTTTCC 
TCTTGTGATT TCTTTAAAAG AATGACTATA TAATTTATTT CCACTAAAAA TATTGTTTCT 
GCATTCATTT TTATAGCAAC AACAATTGGT AAAACTCACT GTGATCAATA TTTTTATATC 
ATGCAAAATA TGTTTAAAAT AAAATGAAAA TTGTATTAT 



(2) INFORMATION FOR SEQ ID NO:33: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 419 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:33: 
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Met: His Leu Leu Gly Phe Phe Ser val Ala Cys Ser Leu Leu Ala Ala 
15 10 15 

Ala Leu Leu Pro Gly Pro Arg Glu Ala Pro Ala Ala Ala Ala Ala Phe 
20 25 30 

Glu Ser Gly Leu Asp Leu Ser Asp Ala Glu Pro Asp Ala Gly Glu Ala 
35 40 45 

Thr* Ala Tyr Ala Ser Lys Asp Leu Glu Glu Gin Leu Arg Ser Val Ser 
50 55 60 

Ser Val Asp Glu Leu Met Thr Val Leu Tyr Pro Glu Tyr Trp Lys Met 
65 70 75 80 

Tyr Lys Cys Glxx Leu Arg Lys Gly Gly Trp Gin His Asn Arg Glu Gin 

85 90 95 

Ala Asn Leu Asn Ser Arg Thr Glu Glu Thr lie Lys Pile Ala Ala Ala 
X0O 105 HO 

His Tyr Asn Thar Glu lie Leu Lys Ser He Asp Asn Glu Trp Arg Lys 
115 120 125 
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Thr Gin Cys Met Pro Arg Glu Val Cys lie Asp Val Gly Lys Glu Phe 
130 135 140 

Gly Val Ala Thr Asn Thr Phe Phe Lys Pro Pro Cys Val Ser Val Tyr 
145 150 155 160 

Arg Cys Gly Gly Cys Cys Asn Ser Glu Gly Leu Gin Cys Met Asn Thr 
165 170 175 

Ser Thr Ser Tyr Leu Ser Lys Thr Leu Phe Glu lie Thr Val Pro Leu 
* 180 185 190 

Ser Gin Gly Pro Lys Pro Val Thr lie Ser Phe Ala Asn His Thr Ser 
155 200 205 

Cys Arg Cys Met Ser Lys Leu Asp Val Tyr Arg Gin Val His Ser lie 
210 215 220 

He Arg Arg Ser Leu Pro Ala Thr Leu Pro Gin Cys Gin Ala Ala Asn 
225 230 235 240 

• - - 

Lys Thr Cys Pro Thr Asn Tyr Met Trp Asn Asn His He Cys Arg Cys 
245 250 255 

Leu Ala Gin Glu Asp Phe Met Phe Ser Ser Asp Ala Gly Asp Asp Ser 
260 265 270 

Thr Asp Gly Phe His Asp He Cys Gly Pro Asn Lys Glu Leu Asp Glu 
275 280 285 

Glu Thr Cys Gin Cys Val Cys Arg Ala Gly Leu Arg Pro Ala Ser Cys 
290 295 300 

Gly Pro His Lys Glu Leu Asp Arg Asn Ser Cys Gin Cys Val Cys Lys 
305 310 315 320 

Asn Lys Leu Phe Pro Ser Gin Cys Gly Ala Asn Arg Glu Phe Asp Glu 
325 330 335 

Asn Thr Cys Gin Cys Val Cys Lys Arg Thr Cys Pro Arg Asn Gin Pro 
340 345 350 

Leu Asn Pro Gly Lys Cys Ala Cys Glu Cys Thr Glu Ser Pro Gin. Lys 
355 360 365 

Cys Leu Leu Lys Gly Lys Lys Phe His His Gin Thr Cys Ser Cys Tyr 
370 375 380 

Arg Arg Pro Cys Thr Asn Arg Gin Lys Ala Cys Glu Pro Gly Phe Ser 
385 390 395 400 

Tyr Ser Glu Glu Val Cys Arg Cys Val Pro Ser Tyr Trp Lys Arg Pro 
405 410 415 

Gin Met Ser 



(2) INFORMATION FOR SEQ ID NO:34: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 34: 

TGAGTGATTTGTAGCTGCTGTG 

(2) INFORMATION FOR SEQ ID NO:35: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:35: 

TATTGCAGCAACCCCCACATCT 

(2) INFORMATION FOR SEQ ID NO:36: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4416 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:36: 
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CCACGCGCAG 


CGGCCGGAGA 


TGCAGCGGGG 


CGCCGCGCTG 


TGCCTGCGAC 


TGTGGCTCTG 


60 




CCTGGGACTC 


CTGGACGGCC 


TGGTGAGTGG 


CTACTCCATG 


ACCCCCCCGA 


CCTTGAACAT 


120 


5 


CACGGAGGAG 


TCACACGTCA 


TCGACACCGG 


TGACAGCCTG 


TCCATCTCCT 


GCAGGGGACA 


180 




GCACCCCCTC 


GAGTGGGCTT 


GGCCAGGAGC 


TCAGGAGGCG 


CCAGCCACCG 


GAGACAAGGA 


240 




CAGCGAGGAC 


ACGGGGGTGG 


TGCGAGACTG 


CGAGGGCACA 


G&CGCCAGGC 


CCTACTGCAA 


300 


10 


GGTGTTG CTG 


CTGCACGAGG 


TACATGCCAA 


CGACACAGGC 


AGCTACGTCT 


GCTACTACAA 


360 




GTACATCAAG 


GCACGCATCG 


AGGGCACCAC 


GGCCGCCAGC 


TCCTACGTGT 


TCGTGAGAGA 


420 




CTTTGAGCAG 


CCATTCATCA 


ACAAGCCTGA 


CACGCTCTT6 


GTCAACAGGA 


AGGACGCCAT 


480 


15 


GTGGGTGCCC 


TGTCTGGTGT 


CCATCCCCGG 


CCTCAATGTC 


ACGCTGCGCT 


CGCAAAGCTC 


540 




GGTGCTGTGG 


CCAGACGGGC 


AGGAGGTGGT 


GTGGGATGAC 


CGGCGGGGCA 


TGCTCGTGTC 


600 




CACGCCACTG 


CTGCACGATG 


CCCTGTACCT 


GCAGTGCGAG 


ACCACCTGGG 


GAGACCAGGA 


660 


20 


CTTCCTTTCC 


AACCCCTTCC 


TGGTGCACAT 


CACAGGCAAC 


GAGCTCTATG 


ACATCCAGCT 


720 




GTTGCCCAGG 


AAGTCGCTGG 


AGCTGCTGGT 


AGGGGAGAAG 


CTGGTCCTGA 


ACTGCACCGT 


780 




GTGGGCTGAG 


TTTAACTCAG 


GTGTCACCTT 


TGACTGGGAC 


TACCCAGGGA 


AGCAGGCAGA 


840 


25 


GCGGGGTAAG 


TGGGTGCCCG 


AGCGACGCTC 


CCAGCAGACC 


CACACAGAAC 


TCTCCAGCAT 


. 900 




CCTGACCATC 


CACAACGTCA 


GCCAGCACGA 


CCTGGGCTCG 


TATGTGTGCA 


AGGCCAACAA 


960 




CGGCATCCAG 


CGATTTCGGG 


AGAGCACCGA 


GGTCATTGTG 


CATGAAAATC 


CCTTCATCAG 


1020 


30 


CGTCGAGTCG 


CTCAAAGGAC 


CCATCCTGGA 


GGCCACGGCA 


GGAGACGAGC 


TGGTGAAGCT 


1080 




GCCCG TGAAG 


CTGGCAGCGT 


ACCCCCCGCC 


CGAGTTCCAG 


TGGTACAAGG 


ATGGAAAGGC 


1140 




ACTGTCCGGG 


CGCCACAGTC 


CACATGCCCT 


GGTGCTCAAG 


GAGGTGACAG 


AGGCCAGCAC 


1200 


35 


AGGCACCTAC 


ACCCTCGCCC 


TGTGGAACTC 


CGCTGCTGGC 


CTGAGGCGCA 


ACATCAGCCT 


1260 




GGAGCTGGTG 


GTGAATGTGC 


CCCCCCAGAT 


ACATGAGAAG 


GAGGCCTCCT 


CCCCCAGCAT 


1320 



40 



45 



50 



55 
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10 



15 



20 



25 



30 



35 



40 



45 



50 



55 



CTACTCGCGT CACAGCCGCC AGGCCCTCAC CTGCACGGCC TACGGGGTGC CCCTGCCTCT 1380 

CAGCATCCAG TGGCACTGGC GGCCCTGGAC ACCCTGCAAG ATGTTTGCCC AGCGTAGTCT 1440 

CCGGCGGCGG CAGCAGCAAG ACCTCATGCC ACAGTGCCGT GACTGGAGGG CGGTGACCAC 1500 

GCAGGATGCC GTGAACCCCA TCGAGAGCCT GGACACCTGG ACCGAGTTTG TGGAGGGAAA 1560 

GAATAAGACT GTGAGCAAGC TGGTGATCCA GAATGCCAAC GTGTCTGCCA TGTACAAGTG 1620 

TGTGGTCTCC 4 AACAAGGTGG GCCAGGATGA GCGGCTCATC TACTTCTATG TGACCACCAT 1680 

CCCCGACGGC TTCACCATCG AATCCAAGCC ATCCGAGGAG CTACTAGAGG GCCAGCCGGT 1740 

GCTCCTGAGC TGCCAAGCCG ACAGCTACAA GTACGAGCAT CTGCGCTGGT ACCGCCTCAA 1800 

CCTGTCCACG CTGCACGATG CGCACGGGAA CC0GCTTCT 6 CTCGACTGCA AGAACGTGCA 1860 

TCTGTTCGCC ACCCCTCTGG CCGCCAGCCT GGAGGAGGTG GCACCTGGGG CGCGCCACGC 1920 

CACGCTCAGC CTGAGTATCC CCCGCGTCGC GCCCGAGCAC GAGGGCCACT ATGTGTGCGA 1980 

AGTGCAAGAC CGGCGCAGCC ATGACAAGCA CTGCCACAAG AAGTACCTGT CGGTGCAGGC 2040 

CCTGGAAGCC CCTCGGCTCA CGCAGAACTT GACCGACCTC CTGGTGAACG TGAGCGACTC 2100 

GCTGGAGATG CAGTGCTTGG TGGCCGGAGC GCACGCGCCC AGCATCGTGT GGTACAAAGA 2160 

CGAGAGGCTG CTGGAGGAAA AGTCTGGAGT CGACTTGGCG GACTCCAACC AGAAGCTGAG 2220 

CATCCAGCGC GTGCGCGAGG AGGATGCGGG ACGCTATCTG TGCAGCGTGT GCAACGCCAA 2280 

GGGCTGCGTC AACTCCTCCG CCAGCGTGGC CGTGGAAGGC TCCGAGGATA AGGGCAGCAT 2340 

GGAGATCGTG ATCCTTGTCG GTACCGGCGT CATCGCTGTC TTCTTCTGGG TCCTCCTCCT 2400 

CCTCATCTTC TGTAACATGA GGAGGCCGGC CCACGCAGAC ATCAAGACGG GCTACCTGTC 2460 

CATCATCATG GACCCCGGGG AGGTGCCTCT GGAGGAGCAA TGCGAATACC TGTCCTACGA 2520 

TGCCAGCCAG TGGGAATTCC CCCGAGAGCG GCTGCACCTG GGGAGAGTGC TCGGCTACGG 2580 

CGCCTTCGGG AAGGTGGTGG AAGCCTCCGC TTTCGGCATC CACAAGGGCA GCAGCTGTGA 2640 

CACCGTGGCC GTGAAAATGC TGAAAGAGGG CGCCACGGCC AGCGAGCACC GCGCGCTGAT 2700 

GTCGGAGCTC AAGATCCTCA TTCACATCGG CAACCACCTC AACGTGGTCA ACCTCCTCGG 2760 

GGCGTGCACC AAGCCGCAGG GCCCCCTCAT GGTGATCGTG GAGTTCTGCA AGTACGGCAA 2820 

CCTCTCCAAC TTCCTGCGCG CCAAGCGGGA CGCCTTCAGC CCCTGCGCGG AGAAGTCTCC 2880 

CGAGCAGCGC GGACGCTTCC GCGCCATGGT GGAGCTCGCC AGGCTGGATC GGAGGCGGCC 2940 

GGGGAGCAGC GACAGGGTCC TCTTCGCGCG GTTCTCGAAG ACCGAGGGCG GAGCGAGGCG 3000 

GGCTTCTCCA GACCAAGAAG CTGAGGACCT GTGGCTGAGC CCGCTGACCA TGGAAGATCT 3060 

TGTCTGCTAC AGCTTCCAGG TGGCCAGAGG GATGGAGTTC CTGGCTTCCC GAAAGTGCAT 3120 

CCACAGAGAC CTGGCTGCTC GGAACATTCT GCTGTCGGAA AGCGACGTGG TGAAGATCTG 3180 

TGACTTTGGC CTT6CCCGGG ACATCTACAA AGACCCTGAC TACGTCCGCA AGGGCAGTGC 3240 

CCGGCTGCCC CTGAAGTGGA TGGCCCCTGA AAGCATCTTC GACAAGGTGT ACACCACGCA 3300 

GAGTGACGTG TGGTCCTTTG GGGTGCTTCT CTGGGAGATC TTCTCTCTGG GGGCCTCCCC 3360 

GTAC CCTGGG GTGCAGATCA ATGAGGAGTT CTGCCAGCGG CTGAGAGACG GCACAAGGAT 3420 
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GAGGGCCCCG 


GAGCTGGCCA 


CTCCCGCCAT 


ACGCCGCATC ATGCTGAACT GCTGGTCCGG 


3480 


AGACCCCAAG 


OCGAGACCTG 


CATTCTCGGA 


GCTGGTGGAG ATCCTGGGGG ACCTGCTCCA 


3540 


GGGCAGGGGC 


CTGCAAGAGG 


AAGAGGAGGT 


CTGCATGGCC CCGCGCAGCT CTCAGAGCTC 


3600 


AGAAGAGGGC 


AG CTTCTCGC 


AGGTGTCCAC 


CATGGCCCTA CACATCGCCC AGGCTGACGC 


3660 


TGAGGACAGC 


CCGCCAAGCC 


TGCAGCGCCA 


CAGCCTGGCC GCCAGGTATT ACAACTGGGT 


3720 


GTCCTTTCCC 


'GGGTGCCTGG 


CCAGAGGGGC 


TGAGACCCGT GGTTCCTCCA GGATGAAGAC 


3780 


ATTTGAGGAA 


TTCCCCATGA 


CCCCAACGAC 


CTACAAAGGC TCTGTGGACA ACCAGACAGA 


3840 


CAGTGGGATG 


GTGCTGGCCT 


CGGAGGAGTT 


TGAGCAGATA GAGAGCAGGC ATAGACAAGA 


3900 


AAGCGGCTTC 


AGGTAGCTGA 


AGCAGAGAGA 


GAGAAGGCAG CATACGTCAG CATTTTCTTC 


3960 


TCTGCACTTA 


TAAGAAAGAT 


CAAAGACTTT 


AAGACTTTCG CTATTTCTTC TACTGCTATC 


4020 


TACTACAAAC 


TTCAAAGAGG 


AACCAGGAGG 


ACAAGAGGAG CATGAAAGTG GACAAGGAGT 


4080 


GTGACCACTG 


AAGCACCACA 


GGGAAGGGGT 


TAGGCCTCCG GATGACTGCG GGCAGGCCTG 


4X40 


GATAATATCC 


AGCCTCCCAC 


AAGAAGCTGG 


TGGAGCAGAG TGTTCCCTGA CTCCTCCAAG 


4200 


GAAAGGGAGA 


CGCCCTTTCA 


TGGTCTGCTG 


AGTAACAGGT GCNTTCCCAG ACACTGGCGT 


4260 


TACTGCTTGA 


CGAAAGAGCC 


CTCAAGCGGC 


CCTTATGCCA GCGTGACAGA GGGCTCACCT 


4320 


CTTGCCTTCT 


AGGTCACTTC 


TCACACAATG 


TCCCTTCAGC ACCTGACCCT GTGCCCGCCA 


4380 


GTTATTCCTT 


GGTAATATGA 


GTAATACATC 


AAAGAG 


4416 


(2) INFORMATION FOR SEQ ID NO: 37; 







(i) SEQUENCE CHARACTERISTICS: 

35 (A) LENGTH: 4273 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

40 (jj) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:37: 
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>vrtV->V, X XaXI»VT 


*%X X X wVJtnrilwW 










0 1/ 










X XWX X*—#Mi»\- 


X v>w X iWi 




1 <9A 


5 






& C2^W2 ^* & A. & T*f 








i fin 






Vi ft. Xuwv>vUviV> 








V. X X iuui\,Uw 








V3HAV9\3nA^ X 


1*11 f T*I*f 






\» invfui 


O v w 


10 


"Tv f J* T*^T^^^* ^\ ~ 

Aui/*X X Jl#UwiVa 


CXCTAAOOTA 


AHiAlAnnnJl 


XXX i>UWJ X\9 X 


#»X#W%XVrX\9X X 


wtt. X AW iwn 






TTCXAAxTGT 


X XblulAX X X 


XAwil X UL/vi 


w k'"B "fYTin & *B f 
v*V* X AluwviU 


X WilwiHiw 


KtW&*—JiAa Xi>V» X 


AO A 




GGAATGCCTT 


TAATGAGGAA 


AACCTGTTTT 


6CTCA6AAGA 


AATGCCATCT 


ASTGATGATG 


480 


15 


AGGCTACTGC 


TGACTCTCAA 


CATTCTACTC 


CTCCAAAAAA 


GAAGAGAAAG 


GTAGAAGACC 


540 




CCAAGGACTT 


TCCTTCAGAA 


TTGCTAAGTT 


TTTTGAGTCA 


TGCTGTGTTT 


AGTAATAGAA 


600 




CTCTTGCTTG 


CTTTGCTATT 


TACACCACAA 


AGGAAAAAGC 


T6CACT6CTA 


TACAAGAAAA 


660 


20 


TTATGGAAAA ATATTCTGTA 


ACCTTTATAA 


GTAGGCATAA 


CAGTTATAAT 


CATAACATAC 


720 



25 



30 



35 
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TGTTTTTTCT TACTCCACAC AGGCATAGAG TGTCTGCTAT TAATAACTAT GCTCAAAAAT 780 

TGTGTACCTT TAGCTTTTTA ATTTGTAAAG GGGTTAATAA GGAATATTTG ATGTATAGTG 840 

CCTTGACTAG AGATCATAAT CAGCCATACC ACATTTGTAG AGGTTTTACT TGCTTTAAAA 900 

AACCTCCCAC ACCTCCCCCT GAACCTGAAA CATAAAATGA ATGCAATTGT TGTTGTTAAC BSD 

TTGTTTATTG CAGCTTATAA TGGTTACAAA TAAAGCAATA GCATCACAAA TTTCACAAAT 1020 

AAAGCATTTT TTTCACTGCA TTCTAGTTGT GGTTTGTCCA AACTCATCAA TGTATCTTAT 10 BO 

CATGTCTGGA TCTGCCGGTC TCCCTATAGT GAGTCGTATT AATTTCGATA AGCCAGGTTA 1140 

ACCTGCATTA ATGAATCGGC CAACGCGCGG GGAGAGGCGG TTTGCGTATT GGGCGCTCTT 1200 

CCGCTTCCTC GCTCACTGAC TCGCTGCGCT CGGTCGTTCG GCTGCGGCGA GCGGTATCAG 1260 

CTCACTCAAA GGCGGTAATA CGGTTATCCA CAGAATCAGG GGATAACGCA GGAAAGAACA 1320 

TGTGAGCAAA AGGCCAGCAA AAGGCCAGGA ACCGTAAAAA GGACGCGTTG CTGGCGTTTT 1380 

TCCATAGGCT CCGCCCCCCT GACGAGCATC ACAAAAATCG ACGCTCAAGT CAGAGGTGGC 1440 

GAAACCCGAC AGGACTATAA AGATACCAGG CGTTTCCCCC TGGAAGCTCC CTCGTGCGCT 1500 

CTCCTGTTCC GACCCTGCCG CTTACCGGAT ACCTGTCCGC CTTTCTCCCT TCGGGAAGCG 1560 

TGGCGCTTTC TCAATGCTCA CGCTGTAGGT ATCTCAGTTC GGTGTAGGTC GTTCGCTCCA 1620 

AGCTGGGCTG TGTGCACGAA CCCCCCGTTC AGCCCGACCG CTGCGCCTTA TCCGGTAACT 1680 

ATCGTCTTGA GTCCAACCCG GTAAGACACG ACTTATCGCC ACTGGCAGCA GCCACTGGTA 1740 

ACAGGATTAG CAGAGCGAGG TATGTAGGCG GTGCTACAGA GTTCTTGAAG TGGTGGCCTA 1800 

ACTACGGCTA CACTAGAAGG ACAGTATTTG GTATCTGCGC TCTGCTGAAG CCAGTTACCT 1860 

TCGGAAAAAG AGTTGGTAGC TCTTGATCCG GCAAACAAAC CACCGCTGGT AGCGGTGGTT 1920 

TT T TTGTTTG CAAGCAGCAG ATTACGCGCA GAAAAAAAGG ATCTCAAGAA GATCCTTTGA 1S80 

TCTTTTCTAC GGGGTCTGAC GCTCAGTGGA ACGAAAACTC ACGTTAAGGG ATTTTGGTCA 2040 

TGAGATTATC AAAAAGGATC TTCACCTAGA TC CTTTT AAA TTAAAAATGA AGTTTTAAAT 2100 

CAATCTAAAG TATATATGAG TAAACTTGGT CTGACAGTTA CCAATGCTTA ATCAGTGAGG 2160 

CACCTATCTC AGCGATCTGT CTATTTCGTT CATCCATAGT TGCCTGACTC CCCGTCGTGT 2220 

AGATAACTAC GATACGGGAG GGCTTACCAT CTGGCCCCAG TGCTGCAATG ATACCGCGAG 2280 

ACCCACGCTC ACCGGCTCCA GATTTATCAG CAATAAACCA GCCAGCCGGA AGGGC CGAGC 2340 

GCAGAAGTGG TCCTGCAACT TTATCCGCCT CCATCCAGTC TATTAATTGT TGCCGGGAAG 2400 

CTAGAGTAAG TAGTTCGCCA GTTAATAGTT TGCGCAACGT TGTTGCCATT GCTACAGGCA 2460 

TCGTGGTGTC ACGCTCGTCG TTTGGTATGG CTTCATTCAG CTCCGGTTCC CAACGATCAA 2520 

GGCGAGTTAC ATGATCCCCC ATGTTGTGCA AAAAAGCGGT TAGCTCCTTC GGTCCTCCGA 2580 

TCGTTGTCAG AAGTAAGTTG GCCGCAGTGT TATCACTCAT GGTTATGGCA GCACTGCATA 2640 

ATTCTCTTAC TGTCATGCCA TCCGTAAGAT GCTTTTCTGT GACTGGTGAG TACTCAACCA 2700 

AGTCATTCTG AGAATAGTGT ATGCGGCGAC CGAGTTGCTC TTGCCOGGCG TCAATACGGG 2760 

ATAATACCGC GCCACATAGC AGAACTTTAA AAGTGCTCAT CATTGGAAAA CGTTCTTCGG 2820 
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GGCGAAAACT 


CTCAAGGATC 


TTACCGCTGT 


TGAGATCCAG 


TTCGATGTAA 


CCCACTCGTG 


2880 




CACCCAACTG 


ATCTTCAGCA 


TCTTTTACTT 


TCACCAGCGT 


TTCTGGGTGA 


GCAAAAACAG 


2940 


5 


GAAGGCAAAA 


TGCCGCAAAA 


AAGGGAATAA 


GGGCGACACG 


GAAATGTTGA ATACTCATAC 


3000 




TCTTCCTTTT 


TCAATATTAT 


TGAAGCATTT 


ATCAGGGTTA 


TTGTCTCATG 


AGCGGATACA 


3060 




TATTTGAATG 


TATTTAGAAA 


AATAAACAAA 


TAGGGGTTCC 


GCGCACATTT 


CCCCGAAAAG 


3X20 


10 


tgccacctga' cgtctaagaa 


ACCATTATTA 


TCATGACATT AACCTATAAA AATAGGCGTA 


3180 




TCACGAGGCC 


CTTTCGTCTC 


GCGCGTTTCG 


GTGATGACGG TGAAAACCTC 


TGACACATGC 


3240 




AGCTCCCGGA 


GACGGTCACA 


GCTTGTCTGT 


AAGCGGATGC 


CGGGAGCAGA 


CAAGCCCGTC 


3300 


15 


AGGGCGOGTC 


AGCGGGTGTT 


GGCGGGTGTC 


GGGGCTGGCT 


TAACTATGCG 


GCATCAGAGC 


3360 




AGATTGTACT 


GAGAGTGCAC 


CATATGGACA 


TATTGTCGTT AGAACGCGGC 


TACAATTAAT 


3420 




ACATAACCTT ATCTATCATA 


CACATACGAT TTAGGTGACA CTATAGAACT CGAGCAGAGC 


3480 


20 


TTCCAAATTG 


AGAGAGAGGC 


TTAATCAGAG 


ACAGAAACTG 


TTTGAGTCAA 


CTCAAGGATG 


3540 




GTTTGAGGGA 


CTGTTTAACA 


GATCCCCTTG 


GTTTACCACC 


TTGATATCTA 


CCATTATGGG 


3600 




ACCCCTCATT 


GTAC7CCTAA 


TGATTTTGCT 


CTTCGGACCC 


TGCATTCTTA ATCGATTAGT 


3660 


25 


CCAATTTGTT 


AAAGACAGGA 


TATCAGTGGT 


CCAGGCTCTA 


GTTTTGACTC 


AACAATATCA 


3720 




CCAGCTGAAG 


CCTATAGAGT 


ACGAGCCATA 


GATAAAATAA 


AAGATTTTAT 


TTAGTCTCCA 


3780 




GAAAAAGGGG 


GGAATGAAAG 


ACCCCACCTG 


TAGGTTTGGC 


AAGCTAGCTT 


AAGTAACGCC 


3840 


30 


ATTTTGCAAG 


GCATGGAAAA 


ATACATAACT 


GAGAATAGAG 


AAGTTCAGAT 


CAAGGTCAGG 


3900 




AACAGATGGA 


ACAGCTGAAT 


ATGGGCCAAA 


CAGGATATCT 


GTGGTAAGCA 


GTTCCTGCCC 


3960 




CGGCTCAGGG 


CCAAGAACAG 


ATGGAAGAGC 


7GAATATGGG 


CCAAACAGGA 


TATCTGTGGT 


4020 


35 


AAGCAGTTCC 


TGCCCCGGCT 


CAGGGCCAAG 


AACAGATGGT 


CCCCAGATGC 


GGTCCAGCCC 


4080 




TCAGCAGTTT 


CTAGAGAACC 


ATCAGATGTT 


TCCAGGGTGC 


CCCAAGGACC 


TGAAATGACC 


4140 




CTGTGCCTTA 


TTTGAACTAA 


CCAATCAGTT 


CGCTTCTCGC 


TTCTGTTCGC 


GCGCTTCTGC 


4200 


40 


TCCCCGAGCT 


CAATAAAAGA 


GCCCACAACC 


CCTCACTCGG 


GGCGCCAGTC 


CTCCGATTGA 


4260 




CTGAGTCGCC 


CGG 










4273 



(2) INFORMATION FOR SEQ ID NO:38: 

45 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 216 base pairs 

(B) TYPE: nucleic acid 

50 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECEILE TYPE: cDNA 

55 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:38: 
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CAAGAAAGCG GCTTCAGCTG TAAAGGACCT GGCCAGAATG TGGCTGTGAC CAGGGCACAC 



60 



CCTGACTCCC AAGGGAGGCG GCGGCGGCCT GAGCGGGGGG CCCGAGGAGG CCAGGTGTTT 



120 



TACAACAGCG AGTATGGGGA GCTGTCGGAG CCAAGCGAGG AGGACCACTG CTCCCCGTCT 



iao 



GCCCGCGTGA CTTTCTTCAC AGACAACAGC TACTAA 



216 



(2) INFORMATION FOR SEQ ID NO:39: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 17 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:39: 



Qlu Glu Thr lie Lys Phe Ala Ala Ala His Tyr Asn Thr Glu He Leu 
15 10 15 



(2) INFORMATION FOR SEQ ID NO:40: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1836 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 
(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 168.. 1412 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:40: 
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GCGGCCGCGT CGACGCAAAA GTTGCGAGCC GCCGAGTCCC GGGAGACGCT CGCCCAGGGG 60 

GGTCCCCGGG AGGAAACCAC GGGACAGGGA CCAGGAGAGG ACCTCAGCCT CACGCCCCAG 120 

5 CCTGCGCCAG CCAACGGACC GGCCTCCCTG CTCCCGGTCC ATCCACC ATG CAC TTG 176 

Met His Leu 
1 

CTG TGC TTC TTG TCT CTG GCG TGT TCC CTG CTC GCC GCT GCG CTG ATC 224 
Leu Cys Phe Leu Ser Leu Ala Cys Ser Leu Leu Ala Ala Ala Leu lie 

10 5 10 15 

CCC AGT CCG CGC GAG GCG CCC GCC ACC GTC GCC GCC TTC GAG TCG GGA 272 
Pro Ser Pro Arg Glu Ala Pro Ala Thr Val Ala Ala Phe Glu Ser Gly 
20 25 30 35 

15 CTG GGC TTC TCG GAA GCG GAG CCC GAC GGG GGC GAG GTC AAG GCT TTT 320 

Leu Gly Phe Ser Glu Ala Glu Pro Asp Gly Gly Glu Val Lys Ala Phe 
40 45 50 

GAA GGC AAA GAC CTG GAG GAG CAG TTG CGG TCT CTG TCC AGC GTA GAT 366 
Glu Gly Lys Asp Leu Glu Glu Gin Leu Arg Ser Val Ser ser Val Asp 
20 55 60 65 

GAG CTG ATG TCT GTC CTG TAC CCA GAC TAC TGG AAA ATG TAC AAG TGC 416 
Glu Leu Met Ser val Leu Tyr Pro Asp Tyr Trp Lys Met Tyr Lys Cys 
70 75 80 

CAG CTG CGG AAA GGC GGC TGG CAG CAG CCC ACC CTC AAT ACC AGG ACA 464 
Gin Leu Arg Lys Gly Gly Trp Gin Gin Pro Thr Leu Asn Thr Arg Thr 
85 90 95 

GGG GAC AGT GTA AAA TTT GCT GCT GCA CAT TAT AAC ACA GAG ATC CTG 512 

Gly Asp Ser Val Lys Phe Ala Ala Ala His Tyr Asn Thr Glu lie Leu 
100 105 no 115 

30 



35 



40 



45 



50 
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AAA AGT ATT GAT AAT GAG TGG AGA AAG ACT CAA TGC ATG CCA CGT GAG 560 
Lys ser lie Asp Asn Glu Trp Arg Lys Thr Gin Cys Met Pro Arg Glu 
120 125 130 

5 GTG TGT ATA GAT GTG GGG AAG GAG TTT GGA GCA GCC ACA AAC ACC TTC 608 

Val Cys He Asp Val Gly Lys Glu Phe Gly Ala Ala Thr Asn Thr Phe 
135 140 145 

TTT AAA CCT CCA TGT GTG TCC GTC TAC AGA TGT GGG GGT TGC TGC AAC 656 
Phe Lys Pro Pro Cys Val Ser Val Tyr Arg Cys Gly Gly Cys Cys Asn 
w 150 155 160 

AGG GAG GGG CTG GAG TGC ATG AAC ACC AGC ACA GGT TAC CTC AGC AAG 704 
Arg Glu Gly Leu Gin Cys Met Asn Thr Ser Thr Gly Tyr Leu Ser Lys 
165 170 175 

15 ACG TTG TTT GAA ATT ACA GTG CCT CTC TCA CAA GGC CCC AAA CCA GTC 752 

Thr Leu Phe Glu He Thr Val Pro Leu Ser Gin Gly Pro Lys Pro Val 
180 185 190 195 

ACA ATC AGT TTT GCC AAT CAC ACT TC,C TGC CGG TGC ATG TCT AAA . CTG 800 
Thr lie Ser Phe Ala Asn His Thr Ser Cys Arg Cys Met Ser Lys Leu 
200 205 210 

20 

GAT GTT TAC AGA CAA GTT CAT TCA ATT ATT AGA CGT TCT CTG CCA GCA 848 
Asp Val Tyr Arg Gin Val His Ser He He Arg Arg Ser Leu Pro Ala 
215 220 225 

ACA TTA CCA GAG TGT CAG GCA GCT AAC AAG ACA TGT CCA ACA AAC TAT 896 
pt . Thr Leu Pro Gin Cys Gin Ala Ala Asn Lys Thr Cys Pro Thr Asn. Tyr 

230 235 240 

GTG TGG AAT AAC TAC ATG TGC CGA TGC CTG GCT CAG CAG GAT TTT ATC 944 
Val Trp Asn Asn Tyr Met Cys Arg Cys Leu Ala Gin Gin Asp Phe He 
245 250 255 

30 TTT TAT TCA AAT GTT GAA GAT GAC TCA ACC AAT GGA TTC CAT GAT GTC 992 

Phe Tyr Ser Asn val Glu Asp Asp Ser Thr Asn Gly Phe His Asp Val 
260 265 270 275 

TGT GGA CCC AAC AAG GAG CTG GAT GAA GAC ACC TCT CAG TGT GTC TGC 1040 
Cys Gly Pro Asn Lys Glu Leu Asp Glu Asp Thr Cys Gin Cys Val Cys 
280 285 290 

35 

AAG GGG GGG CTT CGG CCA TCT AGT TGT GGA CCC CAC AAA GAA CTA GAT 1088 
Lys Gly Gly Leu Arg Pro Ser Ser Cys Gly Pro His Lys Glu Leu Asp 
295 300 305 

AGA GAC TCA TGT CAG TGT GTC TGT AAA AAC AAA CTT TTC CCT AAT TCA li36 
40 Arg Asp Ser Cys Gin Cys Val Cys Lys Asn Lys Leu Phe Pro Asn Ser 

310 315 320 

TGT GGA GCC AAC AGG GAA TTT GAT GAG AAT ACA TGT CAG TGT CTA TGT 1184 
Cys Gly Ala Asn Arg Glu Phe Asp Glu Asn Thr Cys Gin Cys Val Cys 
325 330 335 

45 AAA AGA ACG TGT CCA AGA AAT CAG CCC CTG AAT CCT GGG AAA TGT GCC 1232 

Lys Arg Thr Cys Pro Arg Asn Gin Pro Leu Asn Pro Gly Lys Cys Ala 
340 " ' 345 350 355 

TGT GAA TGT ACA GAA AAC ACA CAG AAG TGC TTC CTT AAA GGG AAG AAG 1280 
Cys Glu Cys Thr Glu Asn Thr Gin Lys Cys Phe Leu Lys Gly Lys Lys 
50 360 365 370 

TTC CAC CAT CAA ACA TGC AGT TGT TAC AGA AGA CCG TGT GCG AAT CGA 1328 
Phe His His Gin Thr Cys Ser Cys Tyr Arg Arg Pro Cys Ala Asn Arg 
375 380 385 
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CTG AAG CAT TGT GAT CCA GGA CTG TCC TTT AGT GAA GAA GTA TGC CGC 1376 
Leu Lys His Cys Asp Pro Gly Leu Ser Phe Ser Glu Glu Val Cys Arg 
390 395 400 

5 TGT GTC CCA TCG TAT TGG AAA AGG CCA CAT CTG AAC TAAGATCATA 1422 

Cys Val Pro Ser Tyr Trp Lys Arg Pro His Leu Asn 
405 410 415 

CCAGTTTTCA GTCAGTCACA GTCATTTACT CTCTTGAAGA CTGTTGGAAC AGCACTTAGC 1482 

10 ACTGTCTATG *CACAGAAAGA CTCTGTGGGA CCACATGGTA ACAGAGGCCC AAGTCTGTGT 1542 

TTATTGAACC ATGTGGATTA CTGCGGGAGA GGACTGGCAC TCATGTGCAA AAAAAACCTC 1602 

TTCAAAGACT GGTTTTCTGC CAGGGACCAG ACAGCT GAGG TTTTTCTCTT GTGATTTAAA 1662 

15 AAAAGAATGA CTATATAATT TATTTCCACT AAAAATATTG TTCCTGCATT CATTTTTATA 1722 

GCAATAACAA TTGGTAAAGC TCACTGTGAT CAGTATTTTT ATAACATGCA AAACTATGTT 1782 

TAAAATAAAA TGAAAATTGT ATTATAAAAA AAAAAAAAAA AAAAAAAAAA GCTT - 1836 

20 

(2) INFORMATION FOR SEQ ID NO:41 : 

(i) SEQUENCE CHARACTERISTICS: 

25 (A) LENGTH: 415 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

30 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:41: 
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Met His Leu Leu Cys Phe Leu Ser Leu Ala Cys Ser Leu Leu Ala Ala 
15 10 15 

Ala Leu lie Pro Ser Pro Arg Glu Ala Pro Ala Thr Val Ala Ala Phe 
20 25 30 

Glu Ser Gly Leu Gly Phe Ser Glu Ala Glu Pro Asp Gly Gly Glu Val 
35 40 45 

Lys Ala Phe Glu Gly Lys Asp Leu Glu Glu Gin Leu Arg Ser Val Ser 
50 55 60 

Ser Val Asp Glu Leu Met Ser Val Leu Tyr Pro Asp Tyr Trp Lys Met 
65 70 75 80 

Tyr Lys Cys Gin Leu Arg Lys Gly Gly Trp Gin Gin Pro Thr Leu Asn 

85 90 95 

Thr Arg Thr Gly Asp Ser Val Lys Phe Ala Ala Ala His Tyr Asn Thr 
100 105 110 

Glu lie Leu Lys Ser lie Asp Asn Glu Trp Arg Lys Thr Gin Cys Met 
115 120 125 

Pro Arg Glu Val Cys He Asp Val Gly Lys Glu Phe Gly Ala Ala Thr 
130 135 140 

Asn Thr Phe Phe Lys Pro Pro Cys Val Ser Val Tyr Arg Cys Gly Gly 
145 150 155 160 

Cys Cys Asn Arg Glu Gly Leu Gin Cys Met Asn Thr Ser Thr Gly Tyr 
165 170 175 

Leu Ser Lys Thr Leu Phe Glu He Thr Val Pro Leu Ser Gin Gly Pro 
180 185 190 
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Lys Pro Val Thr lie Ser Phe Ala Asn His Thr Ser Cys Arg Cys Met 
19S 200 205 

Ser Lys Leu Asp Val Tyr Arg Gin Val His Ser lie He Arg Arg Ser 
210 215 220 

Leu Pro Ala Thr Leu Pro Gin Cys Gin Ala Ala Asn Lys Thr Cys Pro 
225 230 235 240 

Thr Asn Tyr Val Trp Asn Asn Tyr Met Cys Arg Cys Leu Ala Gin Gin 
245 250 255 

Asp Phe He Phe Tyr Ser Asn Val Glu Asp Asp Ser Thr Asn Gly Phe 
260 265 270 

His Asp Val Cys Gly Pro Asn Lys Glu Leu Asp Glu Asp Thr Cys Gin 
275 280 285 

Cys Val Cys Lys Gly Gly Leu Arg Pro Ser Ser Cys Gly Pro His Lys 
290 295 300 

Glu Leu Asp Arg Asp Ser cys Gin Cys Val Cys Lys Asn Lys Leu Phe 
305 310 315 320 

Pro Asn Ser Cys Gly Ala Asn Arg Glu Phe Asp Glu Asn Thr Cys Gin 
325 330 335 

Cys Val Cys Lys Arg Thr Cys Pro Arg Asn Gin Pro Leu Asn Pro Gly 
340 345 350 

Lys Cys Ala Cys Glu Cys Thr Glu Asn Thr Gin Lys Cys Phe Leu Lys 
355 360 365 

Gly Lys Lys Phe His His Gin Thr Cys Ser Cys Tyr Arg Arg Pro Cys 
370 375 380 

Ala Asn Arg Leu Lys His Cys Asp Pro Gly Leu Ser Phe Ser Glu Glu 
385 390 395 400 

Val Cys Arg Cys Val Pro Ser Tyr Trp Lys Arg Pro His Leu Asn 
405 410 415 

(2) INFORMATION FOR SEQ ID NO:42: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:42: 



(2) INFORMATION FOR SEQ ID NO: 43: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 
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(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

5 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:43. 



TCTCTTCTGT CCCTGAGTTG AO 22 

10 

(2) INFORMATION FOR SEQ ID NO:44: 

(i) SEQUENCE CHARACTERISTICS: 

15 

(A) LENGTH: 65 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

20 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:44: 

25 

TGTGCTGCAG CAAATTTTAT AGTCTCTTCT GTGGCGGCGG CGGCGGCGGG CGCCTCGCGA 60 
GGACC 65 

(2) INFORMATION FOR SEQ ID NO:45: 

30 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 30 base pairs 

(B) TYPE: nucleic acid 

35 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

40 (xi) SEQUENCE DESCRIPTION: SEQ ID NO:45: 



CTGGCAGGGA ACTGCTAATA ATGGAATGAA 30 

45 (2) INFORMATION FOR SEQ ID NO:46: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 84 base pairs 
50 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

55 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO.-46: 
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GGGCTCCGCG TCCGAGAGGT CGAGTCCGGA CTCGTGATGG TGATGGTGAT GGGCGGCGGC 
GGCGGCGGGC GCCTCGCGAG GACC 

(2) INFORMATION FOR SEQ ID NO: 47: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 31 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:47: 

GTATTATAAT GTCCTCCACC AAATTTTATA G 

(2) INFORMATION FOR SEQ ID NO:48: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 93 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:48: 

GTTCGCTGCC TGACACTGTG GTAGTGTTGC TGGCGGCCGC TAGTGATGGT GATGGTGATG 
AATAATGGAA TGAACTTGTC TGTAAACATC CAG 

(2) INFORMATION FOR SEQ ID NO:49: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:49: 

CATGTACGAA CCGCCAGG 

(2) INFORMATION FOR SEQ ID NO:50: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

5 

(ii) MOLECULE TYPE: cDNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:50: 

10 

AATGACCAGA GAGAGGCGAG 20 

(2) INFORMATION FOR SEQ ID NO:51 : 
15 (i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 10 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 
20 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 51 : 

25 

Ala Val Val Met Thr Gin Thr Pro Ala Ser 
15 10 

30 (2) INFORMATION FOR SEQ In NO:52: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1741 base pairs 
35 (B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

40 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 453.. 1706 

45 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:52: 



GCCCCCGCCG AGCGCTCCGC GCGCAGCCGC CGGGCCGGGC CGGCCCGCGG AGGGCGCGCT 60 

50 

GCGAGCGGCC ACTGGGTCCT GCTTCCCTCC TTCCTCTCCC TCCTCCTCCT CCTCCTTCTC 120 

TCTGCGCTTT CCACCGCTCC CGAGCGAGCG CACGCTCGGA TGTCCGGTTT CCTGGTGGGT 180 
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5 



15 



20 



35 



45 



50 



TTTTTACCTG GCAAAGTCCG GATAACTTCG GTGAGAATTT 


GCAAAGAGGC TGGGAGCTCC 


240 


CCTGCAGGCG TCTGGGAGCT GCTGCCGCCG TCGCATCTTC 


TCCATCCCGC GGATTTTACT 


o f\ rx 
300 


GCCTTGGATA TTGCGAGGGG AGGGAGGGGG GTGAGGACAG 


CAAAAAGAAA GGGGTGGGGG 


360 


GGGGGAGAGA AAAGGAAAAG AAGGAGCCTC GGAATTGTGC 


CCGCATTCCT GCGCTGCCCC 


420 


GCGGCCCCCC TCCGCTCTGC CATCTCCGCA CA 

i 


ATG 
Met 
1 


CAC 
His 


TTG 
Leu 


CTG 
Leu 


GAG 
Glu 
5 


ATG 
Met 


CTC 
Leu 


473 


TCC 
Ser 


CTG 
Leu 


GGC 
Gly 
10 


TGC 
Cys 


TGC 
Cys 


CTC 
Leu 


GCT 
Ala 


GCT 
Ala 
15 


GGC 
Gly 


GCC 
Ala 


GTG 
Val 


CTC 
Leu 


CTG 
Leu 
20 


GGA 
Gly 


CCC 
Pro 


GGG 
Arg 


521 


CAG 
Gin 


CCG 
Pro 
25 


CCC 
Pro 


GTC 

val 


GCC 
Ala 


GCC 
Ala 


GCC 
Ala 
30 


TAC 
Tyr 


GAG 
Glu 


TCC 
Ser 


GGG 
Gly 


CAC 
His 
35 


GGC 
Gly 


TAC 
Tyr 


TAC 
Tyr 


GAG 
Glu 


569 


GAG 
Glu 
40 


GAG 
Glu 


CCC 
Pro 


GGT 
Gly 


GCC 
Ala 


GGG 
Gly 
45 


GAA 
Glu 


CCC 
Pro 


AAG 
Lys 


GCT 
Ala 


CAT 
His 
50 


GCA 
Ala 


AGC 
Ser 


AAA 

Lys 


GAC 
Asp 


CTG 
Leu 
55 


617 


GAA 
Glu 


GAG 
Glu 


CAG 
Gin 


TTG 
Leu 


CGA 
Arg 
€0 


TCT 

ser 


GTG 
val 


TCC 
Ser 


AGT 
Ser 


GTG 
Val 
65 


GAT 
Asp 


GAA 
Glu 


CTC 
Leu 


ATG 
Met 


ACA 
Thr 
70 


GTA 
val 


665 


CTT 
Leu 


TAC 
Tyr 


CCA 
Pro 


GAA 
Glu 
75 


TAC 
Tyr 


TGG 
Trp 


AAA 

Lys 


ATG 
Met 


TTC 
Phe 
80 


AAA 

Lys 


TGT 
Cys 


CAG 
Gin 


TTG 
Leu 


AGG 
Arg 
85 


AAA 

Lys 


GGA 
Gly 


713 


GGT 
Gly 


TGG 
Trp 


CAA 
Gin 
90 


CAC 
His 


AAC 
Asn 


AGG 
Arg 


GAA 
Glu 


CAC 
His 
95 


TCC 
Ser 


AGC 
Ser 


TCT 
Ser 


GAT 
Asp 


ACA 
Thr 
100 


AGA 
Arg 


TCA 

Ser 


GAT 
Asp 


761 


GAT 
Asp 


TCA 
Ser 
105 


TTG 
Leu 


AAA 
Lys 


TTT 
Phe 


GCC 
Ala 


GCA 
Ala 
110 


GCA 
Ala 


CAT 
His 


TAT 
Tyr 


AAT 
Asn 


GCA 
Ala 
115 


GAG 
Glu 


ATC 
He 


CTG 
Leu 


AAA 

Lys 


809 


AGT 
Ser 
120 


ATT 
He 


GAT 
Asp 


ACT 
Thr 


GAA 
Glu 


TGG 
Trp 
125 


AGA 
Arg 


AAA 

Lys 


ACC 
Thr 


CAG 
Gin 


GGC 
Gly 
130 


ATG 
Met 


CCA 
Pro 


CGT 
Arg 


GAA 
Glu 


GTG 
Val 
135 


857 


TGT 
Cys 


GTG 
Val 


GAT 
Asp 


TTG 
Leu 


GGG 
Gly 

140 


AAA 
Lys 


GAG 
Glu 


TTT 
Phe 


GGA 
Gly 


GCA 
Ala 
145 


ACT 
Thr 


ACA 
Thr 


AAC 

Asn 


ACC 
Thr 


TTC 
Phe 
150 


TTT 

Phe 


905 


AAA 
Lys 


CCC 
Pro 


CCG 
Pro 


TGT 
Cys 
155 


GTA 
Val 


TCC 
Ser 


ATC 
He 


TAC 
Tyr 


AGA 
Arg 
160 


TGT 
Cys 


GGA 
Gly 


GGT 
Gly 


TGC 
Cys 


TGC 
Cys 
165 


AAT 
Asn 


AGT 
Ser 


953 


GAA 
Glu 


GGA 
Gly 


CTC 
Leu 
170 


CAG 

Gin 


TGT 

Cys 


ATG 
Met 


AAT 
Asn 


ATC 
He 
175 


AGC 
Ser 


ACA 
Thr 


AAT 
Asn 


TAC 
Tyr 


ATC 
He 
180 


AGC 
Ser 


AAG 
Lys 


ACA 
Thr 


1001 


TTG 
Leu 


TTT 
Phe 
185 


GAG 
Glu 


ATT 
He 


ACA 
Thr 


GTG 
Val 


CCT 
Pro 
190 


CTC 
Leu 


TCT 
Ser 


CAT 
His 


GGC 
Gly 


CCC 
Pro 
195 


AAA 

Lys 


CCT 
Pro 


GTA 
val 


ACA 
Thr 


1049 


GTC 
val 
200 


AGT 
Ser 


TTT 
Phe 


GCC 
Ala 


AAT 
Asn 


CAC 
His 
205 


ACG 
Thr 


TCC 
Ser 


TGC 
Cys 


CGA 
Arg 


TGC 
Cys 
210 


ATG 
Met 


TCT 
Ser 


AAG 
Lys 


TTG 
Leu 


GAT 
Asp 
215 


1097 


GTT 
Val 


TAC 
Tyr 


AGA 
Arg 


CAA 
Gin 


GTT 
Val 


CAT 
Bis 


TCT 
Ser 


ATC 
He 


ATA 
He 


AGA 
Arg 


CGT 
Arg 


TCC 
Ser 


TTG 
Leu 


CCA 
Pro 


GCA 
Ala 


ACA 
Thr 


1145 
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CAA ACT CAG TGT CAT GTG GCA AAC AAG ACC TGT CCA AAA AAT CAT GTC 
Gin Thr Gin Cys His Val Ala Asn Lys Thr Cys Pro Lys Asn His Val 
235 240 245 

TGG AAT AAT CAG ATT TGC AGA TGC TTA GCA CAG CAC GAT TTT GGT TTC 
Trp Asn Asn Gin He Cys Arg Cys Leu Ala Gin His Asp Phe Gly Phe 
250 255 260 

TCT TCT CAC CTT GGA GAT TCT GAC ACA TCT GAA GGA TTC CAT ATT TGT 
Ser Ser His Leu Gly Asp Ser Asp Thr Ser Glu Gly Phe His lie Cys 
265 270 275 

GGG CCC AAC AAA GAG CTG GAT GAA GAA ACC TGT CAA TGC GTC TGC AAA 
Gly Pro Asn Lys Glu Leu Asp Glu Glu Thr Cys Gin Cys Val Cys Lys 
280 285 290 295 

GGA GGT GTG CGG CCC ATA AGC TGT GGC CCT CAC AAA GAA CTA GAC AGO 
Gly Gly Val Arg Pro lie Ser Cys Gly Pro His Lys Glu Leu Asp Arg 
300 305 310 

GCA TCA TGT CAG ^C ATG TGC AAA AAC AAA CTG CTC CCC AGT TCC TGT 
Ala Ser Cys Gin Cys Met Cys Lys Asn Lys Leu Leu Pro Ser Ser Cys 
315 320 325 

GGG CCT AAC AAA GAA TTT GAT GAA GAA AAG TGC CAG TGT GTA TGT AAA 
Gly Pro Asn Lys Glu Phe Asp Glu Glu Lys Cys Gin Cys Val Cys Lys 
330 335 340 

AAG ACC TGT CCC AAA CAT CAT CCA CTA AAT CCT GCA AAA TGC ATC TGC 
Lys Thr Cys Pro Lys His His Pro Leu Asn Pro Ala Lys Cys He Cys 
345 350 3S5 

GAA TGT ACA GAA TCT CCC AAT AAA TGT TTC TTA AAA GGA AAA AGA TTT 
Glu Cys Thr Glu Ser Pro Asn Lys Cys Phe Leu Lys Gly Lys Arg Phe 
360 365 370 375 

CAT CAC CAG ACA TGC AGT TGT TAC AGA CCA CCA TGT ACA GTC CGA ACG 
His His Gin Thr Cys Ser Cys Tyr Arg Pro Pro Cys Thr Val Arg Thr 
380 385 390 

AAA CGC TGT GAT GCT GGA TTT CTG TTA GCT GAA GAA GTG TGC CGC TGT 
Lys Arg Cys Asp Ala Gly Phe Leu Leu Ala Glu Glu Val Cys Arg Cys 
395 400 405 

GTA CGC ACA TCT TGG AAA AGA CCA CTT ATG AAT TAAGCGAAGA AAGCACTACT 
Val Arg Thr Ser Trp Lys Arg Pro Leu Met Asn 
4X0 415 

CGCTATATAG TGTCG 



(2) INFORMATION FOR SEQ ID NO:53: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 418 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:53: 
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Met His Leu Leu Glu Met Leu Ser Leu Gly Cys Cys Leu Ala Ala Gly 
15 10 15 

Ala Val Leu Leu Gly Pro Arg Gin Pro Pro Val Ala Ala Ala Tyr Glu 
20 25 30 

Ser Gly His Gly Tyr Tyr Glu Glu Glu Pro Gly Ala Gly Glu Pro Lys 
35 40 45 
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Ala His Ala Ser Lys Asp lieu Glu Glu Gin Leu Arg Ser Val Ser Ser 
50 55 60 

Val Asp Glu Leu Met Thr Val Leu Tyr Pro Glu Tyr Trp Lys Met Phe 
65 70 75 & 80 

Lys Cys Gin Leu Arg Lys Gly Gly Trp Gin His Asn Arg Glu His Ser 
85 90 95 

Ser Ser Asp Thr Arg Ser Asp Asp Ser Leu Lys Phe Ala Ala Ala His 
100 105 110 

Tyr Asn Ala Glu lie Leu Lys Ser He Asp Thr Glu Trp Arg Lys Thr 
115 120 125 

Gin Gly Met Pro Arg Glu Val Cys Val Asp Leu Gly Lys Glu Phe Gly 
130 135 140 

Ala Thr Thr Asn Thr Phe Phe Lys Pro Pro Cys Val Ser He Tyr Arg 
145 150 155 160 

Cys Gly Gly Cys Cys Asn Ser Glu Gly Leu Gin Cys Met Asn He Ser 
165 170 175 

Thr Asn Tyr He Ser Lys Thr Leu Phe Glu He Thr Val Pro Leu Ser 
180 185 190 

His Gly Pro Lys Pro val Thr Val Ser Phe Ala Asn His Thr Ser Cys 
195 200 205 

Arg Cys Met Ser Lys Leu Asp Val Tyr Arg Gin Val His Ser He He 
210 215 220 

Arg Arg Ser Leu Pro Ala Thr Gin Thr Gin Cys His Val Ala Asn Lys 
225 230 235 240 

Thr Cys Pro Lys Asn His Val Trp Asn Asn Gin He Cys Arg Cys Leu 
245 250 255 

Ala Gin His Asp Phe Gly Phe Ser Ser His Leu Gly Asp Ser Asp Thr 
260 265 270 

Ser Glu Gly Phe His He Cys Gly Pro Asn Lys Glu Leu Asp Glu Glu 
275 280 285 

Thr Cys Gin Cys Val Cys Lys Gly Gly Val Arg Pro He Ser Cys Gly 
290 295 300 

Pro His Lys Glu Leu Asp Arg Ala Ser Cys Gin Cys Met Cys Lys Asn 
305 310 315 320 

Lys Leu Leu Pro Ser Ser Cys Gly Pro Asn Lys Glu Phe Asp Glu Glu 
325 330 335 

Lys Cys Gin Cys Val Cys Lys Lys Thr Cys Pro Lys His His Pro Leu 
340 345 350 

Asn Pro Ala Lys Cys He Cys Glu Cys Thr Glu Ser Pro Asn Lys Cys 
355 " 360 365 

Phe Leu Lys Gly Lys Arg Phe His His Gin Thr Cys Ser Cys Tyr Arg 
370 375 380 

Pro Pro Cys Thr Val Arg Thr Lys Arg Cys Asp Ala Gly Phe Leu Leu 
385 390 * 395 400 

Ala Glu Glu Val Cys Arg Cys Val Arg Thr Ser Trp Lys Arg Pro Leu 
405 410 415 

Met Asn 
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(2) INFORMATION FOR SEQ ID NO:54: 
(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 1582 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:54: 



20 



CTAGAGTTGA 


AC CAGAT AAG 


AAAGTCTCTT CTTCCGGTAA 


GATATTATGG 


ACCTATAACA 


60 


TCTGTG TACT 


TAAAAGTAGA TTGGGAGTGA AAGGCAGACT 


TTTGATGTTC 


TGTACACTG T 


120 


TGAAACCCCT 


TAGCGTGGTC 


CTCTGTAACC TGCTCACCCT 


GCCCCAAGGA 


GGCAGCTAGC 


180 


CAATGCCACC 


AGCCCAACGG 


AAACCCCAGT GCTTTTCCAA 


TGGGGAAATG 


CAGTCACTTT 


240 


TCTTTGGATG 


CTACACATCC 


TTTCTGGAAT ATGTCTCACA 


CACATCTCTC 


TTTATCACCC 


300 


CCTTTTTCAA 


GTAAACCAAC 


TTCTTGCAGA AGCTGACAAT 


GTGTCTCTTT 


ACTCTCCACG 


360 


AAGATTCTGG 


CCCTTCTCTT 


CACCTGTCAG AAGTTTAGGA 


TTCCAAAGGG 


ATCATTAGCA 


420 


TCCATCCCAA 


CAGCCTGCAC 


TGCATCCTGA GAACTGCGGT 


TCTTGGATCA 


TCAGGCAACT 


480 


TTCAACTACA 


CAGACCAAGG 


GAGAGAGGGG ACCCCTCCGA 


GGTCCCATAG 


GGTTCTCTGA 


540 


CATAGTGATG 


ACCTTTTTCC 


AAACTTTGAG CAGGGCGCTG 


GGGGCCAGGC 


GTGCGGGAGG 


600 


GAGGACAAGA 


ACTCGGGAGT 


GGCCGAGGAT AAAGCGGGGG 


CTCCCTCCAC 


CCCACGGTGC 


660 


CCAGTTTCTC 


CCCGCTGCAC 


GTGGTCCAGG GTGGTCGCAT 


CACCTCTAAA 


GCCGGTCCCG 


720 


CCAACCGCCA 


GCCCCGGGAC 


TGAACTTGCC CCTCCGGCCG 


CCCGCTCCCC 


GCAGGGGACA 


780 


GGGGCGGGGA 


GGGAGAGATC 


CAGAGGGGGG CTGGGGGAGG 


TGGGNCCGCC 


GGGGAGGAGN 


640 


CGAGGGAAAC 


GGGG AG CTCC 


AGGGAGACGG CTTCCGAGGG 


AGAGTGAGAG 


GGGAGGGCAG 


900 


CCCGGGCTCG 


GCACGCTCCC 


TCCCTCGGCC GCTTTCTCTC 


ACATAAGCGC 


AGGCAGAGGG 


960 


CGCGTCAGTC 


ATGCCCTGCC 


CCTGCGCCCG CCGCCGCCGC 


CGCC6CCGCT 


CAGCCCGGCG 


1020 


CGCTCTGGAG 


GATCCTGCGC 


CGCGGCGCTC CCGGGCCCCG 


CCGCCGCCAG 


CCGCCCCGGC 


1080 


GGCCCTCCTC 


CCGCCCCCGG 


CACCGCCGCC AGCGCCCCCG CCGCAGCGCC 


CGCGGCCCGG 


1140 


CTCCTCTCAC 


TTCGGGGAAG 


GGGAGGGAGG AGGGGGACGA GGGCTCTGGC 


GGGTTTGGAG 


1200 


GGGCTGAACA 


TCGCGGGGTG 


TTCTGGTGTC CCCCGCOCCG 


CCTCTCCAAA 


AAGCTACACC 


1260 


GACGCGGACC 


GCGGCGGCGT 


CCTCCCTCGC CCTCGCTTCA 


CCTCGCGGGC 


TCCGAATGCG 


1320 


GGGAGCTCGG 


ATGTCCGGTT 


TCCTGTGAGG CTTTTACCTG ACACCCGCCG 


CCTTTCCCCG 


1380 


GCACTGGCTG 


GGAGGGCGCC 


CTGCAAAGTT GGGAACGCGG AGCCCCGGAC 


CCGCTCCCGC 


1440 


CGCCTCCGGC 


TCGCCCAGGG 


GGGGTCGCCG GGAGGAGCCC GGGGGAGAGG 


GACCAGGAGG 


1500 


GGCCCGCGGC 


CTCGCAGGGG 


CGCCCGCGCC CCCACCCCTG 


CCCCCGCCAG 


CGGACCGGTC 


1560 


ccccaccccc 


GGTCCTTCCA CC 






1582 
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(2) INFORMATION FOR SEQ ID NO:55: 
(i) SEQUENCE CHARACTERISTICS: 

5 (A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

10 (ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:55 : 



1 5 CACGGCTTAT . GCAAGCAAAG 2 O 

(2) INFORMATION FOR SEQ ID NO:56: 

(i) SEQUENCE CHARACTERISTICS: 

20 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

25 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:56: 

30 

AACACAQTTT TCCATAATAG 2 O 

(2) INFORMATION FOR SEQ ID NO:57: 
35 (j) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
40 (D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:57: 

45 

GCCACGGTAG GTCTGCGT 18 

(2) INFORMATION FOR SEQ ID NO:58 : 

50 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

55 (C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 
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(xi) SEQUENCE DESCRIPTION: SEQ ID NO:58: 

TTTCTTTGAC AGGCTTAT 

(2) INFORMATION FOR SEQ ID NO:59: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:59: 

ATCTTGAAAA GTAAGTATGG G 

(2) INFORMATION FOR SEQ ID NO:60: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:60: 

ATGACTTQAC AGGTATTGAT 

(2) INFORMATION FOR SEQ ID NO:61: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:61: 

AGCAAGACGG TGGGTATTGT 

(2) INFORMATION FOR SEQ ID NO:62: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:62: 



CCCTTCTTTG TAGTTA.TTTQ AA 

(2) INFORMATION FOR SEQ ID NO:63: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:63: 



C CACAGTGAG TATGAATTAA 

(2) INFORMATION FOR SEQ ID NO:64: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:64: 



TTCTTCCAAA GGTGTCAG 



(2) INFORMATION FOR SEQ ID NO:65: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 18 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 65: 
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GGAGATGGTA GCAGAATG 

(2) INFORMATION FOR SEQ ID NO:66: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:66: 

CTATTTGTCT AGACTCAACA GAT 

(2) INFORMATION FOR SEQ ID NO:67: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:67: 

CAAACATGCA GGTAAGAGAT CC 

(2) INFORMATION FOR SEQ ID NO:68: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO-68: 

TGTTCTCCTA GCTGTTACAG A 

(2) INFORMATION FOR SEQ ID NO:69: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 
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(D) TOPOLOGY: linear 
(ii) MOLECULE TYPE: DNA 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO:69: 

GGCGAGGTCA AGGTAGGTGC AAGG 

(2) INFORMATION FOR SEQ ID NO:70: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:70: 

ATTGTCTTTG ACAGGCTTTT TGAAGG 

(2) INFORMATION FOR SEQ ID NO: 71 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 21 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:71: 

GAGATCCTGA AAAGTAAGTA G 

(2) INFORMATION FOR SEQ ID NO: 72: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:72: 

TGTGACTCGA CAGGTATTGA TAAT 
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(2) INFORMATION FOR SEQ ID NO:73: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:73: 



CTCAGCAAGA CGGTAGGTAT 

(2) INFORMATION FOR SEQ ID NO: 74: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 25 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:74: 



TTGTCCCTTG T AGTTG TTTG AAATT 



(2) INFORMATION FOR SEQ ID NO:75: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 20 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:75: 



ACATTACCAC AGTGAGTATG 

(2) INFORMATION FOR SEQ ID NO:76: 
(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 26 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:76: 

GTCTCCCCAA AAGGTGTCAG GCAGCT 

(2) INFORMATION FOR SEQ ID NO.77: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 23 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:77: 

AATGTTGAAG ATGGTAAGTA AAA 

(2) INFORMATION FOR SEQ ID NO:78: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 16 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 78: 

TCTAGACTCA ACCAAT 

(2) INFORMATION FOR SEQ ID NO:79: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 22 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:79: 

CAAACATGCA GGTAAGGAGT GT 

(2) INFORMATION FOR SEQ ID NO:80: 
(i) SEQUENCE CHARACTERISTICS: 
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(A) LENGTH: 24 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:80: 



TTTTCCCCTA GTTGTTACAG AAGA 24 



(2) INFORMATION FOR SEQ ID NO:81 : 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 19 amino acids 

(B) TYPE: amino acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:81: 



Leu Ser Lys Thr Val Ser Gly Ser Glu Gin Asp Leu Pro His Glu Leu 
15 10 15 

His Val Glu 



Claims 



1. A purified and isolated polypeptide that is capable of binding to Flt4 receptor tyrosine kinase (Flt4), with an amino 
acid sequence comprising a portion of amino acid sequence 103-419 of SEQ ID NO:33, wherein said portion is 
capable of binding to Flt4 and said polypeptide does not include an amino acid sequence corresponding to amino 
acids 94-41 9 of SEQ I D NO:33 in which a Gin residue has been substituted for the Lys residue at position 41 4 in 
SEQ ID NO:33. 

2. A polypeptide as claimed in claim 1 , wherein said portion is a portion of the amino acid sequence encoded by the 
coding insert of plasmid pFLT4-L (deposited under ATCC accession number 97231). 

3. A polypeptide as claimed in claim 1 or claim 2, wherein the portion is a portion of amino acid sequence 103-282 
of SEQ ID NO:33. 

4. A polypeptide as claimed in claim 1 , or claim 2, that lacks all of amino acid sequence 1 -1 02 of SEQ I D NO: 33 and 
all of amino acid sequence 227, 228 or 283-419 of SEQ ID NO:33. 

5. A polypeptide as claimed in any of claims 1-4, consisting of all or a portion of the amino acid sequence encoded 
by the coding insert of plasmid pFLT4-L (deposited under ATCC accession number 97231). 

6. A polypeptide as claimed in any of claims 1-4, with an amino acid sequence consisting of all or a portion of SEQ 
ID NO:33. 

7. A polypeptide as claimed in claim 1, 2 or 3, with an amino acid sequence comprising the amino acid sequence 
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encoded by the coding insert of plasmid pFLT4-L (deposited under ATCC accession number 97231 ), or the amino 
acid sequence of SEQ ID NO: 33. 

8. A polypeptide as claimed in any of the preceding claims with an amino acid sequence including or consisting of 
amino acid residues 31-213, 31-227, 31-419, 32-419, 32-227, 103-217, 103-225, 103-227, 103-419, 104-213, 
104-227, 112-213, 112-227, 113-211, 113-213, 113-227, 131-211 or 161-211 of SEQ ID NO:33. 

9. A polypeptide as claimed in claim 8 with an amino acid sequence including or consisting of amino acid residues: 
103-227, 104-213, 113-213, 131-221 or 161-211 of SEQ ID NO:33. 

10. A polypeptide as claimed in any of the preceding claims, with an amino acid sequence including or consisting of 
amino acid residues 1 03-1 1 2 to 226-227, 31 -1 61 to 21 1 -227 or 1 03-1 31 to 21 1 -227 of SEQ I D NO:33. 

11. A polypeptide as claimed in any of claims 1, 3 and 6, wherein the portion of SEQ ID NO:33 includes 8 cysteine 
residues that are conserved in human vascular endothelial growth factor (VEGF), human platelet derived growth 
factor A (PDGF-A), and human platelet derived growth factor B (PDGF-B). 

12. A polypeptide as claimed in any of claims 1-7, with an amino acid sequence comprising or consisting of amino 
acid residues 103-282, 1 03-227 or 1 03-223 of SEQ ID NO: 33, or a fragment of one of said sequences which binds 
to Flt4. 

13. A polypeptide as claimed in any of claims 1-12 having an apparent molecular weight of about 21-23 or 29-32 kD 
as assessed by SDS-PAGE under reducing conditions. 

14. A polypeptide as claimed in any of the preceding claims, said polypeptide being capable of binding to the extra 
cellular domain of Flt4 receptor tyrosine kinase (Flt4), having an apparent molecular weight of approximately 23kD, 
as determined by SDS-PAGE under reducing conditions, and being purifyable from conditioned media from a PC- 
3 prostatic adenocarcinoma cell line, said cell line having ATCC accession number CRL 1435, using an affinity 
purification procedure wherein the affinity purification matrix comprises a polypeptide comprising the extra cellular 
domain of Flt4. 

15. A polypeptide as claimed in any of the preceding claims, comprising an amino terminal amino acid sequence as 
set forth in SEQ ID NO: 13. 

16. A polypeptide as claimed in any of the preceding claims, capable of binding to and inducing tyrosine autophos- 
phorylation of the VEGFR-2 receptor. 

17. A polypeptide as claimed in any of claims 1-16, wherein said polypeptide is capable of stimulating tyrosine phos- 
phorylation of Flt4 in a host cell expressing Flt4. 

18. A polypeptide as claimed in any of claims 1 -1 6, wherein said polypeptide is capable of binding to the extra cellular 
domain of Flt4. 

19. A polypeptide as claimed in any of claims 1 -1 8 further comprising a detectable label. 

20. A purified protein comprising a first polypeptide linked to a second polypeptide, wherein at least one of said first 
polypeptide and said second polypeptide is a polypeptide according to any of claims 1 -1 9, and wherein said protein 
is capable of binding to Flt4 receptor tyrosine kinase (Flt4). 

21. A protein as claimed in claim 20 wherein said first polypeptide is covalently linked to said second polypeptide. 

22. A protein comprising a first polypeptide linked to a second polypeptide, wherein each of said first polypeptide and 
said second polypeptide is a polypeptide according to any one of claims 1 -1 9. 

23. A pharmaceutical composition comprising a polypeptide as claimed in any of claims 1 -1 9, or a protein as claimed 
in any of claims 20-22, in a pharmaceutically-acceptable diluent, adjuvant, excipient, or carrier. 

24. A purified and isolated nucleic acid molecule comprising or consisting of a nucleotide sequence encoding a 
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polypeptide as claimed in any of claims 1 -1 9, wherein said nucleic acid molecule is not a cDNA molecule with the 
DNA sequence set out in entry HS991157, accession number H07991 .1 , in the EMBL data base. 

25. A nucleic acid molecule as claimed in claim 24, with a nucleotide sequence comprising or consisting of acontiguous 
5 sequence of nucleotides from nucleotide sequence 352-1 608 of SEQ ID NO:32. 

26. A nucleic acid molecule as claimed in claim 24, comprising or consisting of a VEGF-C encoding insert of plasmid 
pFLT4-L, deposited under ATCC accession number 97231. 

10 27. A vector comprising a nucleic acid molecule as claimed in any of claims 24-26. 

28. A vector as claimed in claim 27, with a coding insert comprising or consisting of a nucleic acid molecule as claimed 
in any of claims 24-26. 

15 29. A vector as claimed in claim 27, said vector being plasmid pFI_T4-L, deposited under ATCC accession number 
97231. 

30. A host cell transfected or transformed with a vector as claimed in any of claims 27-29. 

20 31. A method of making a polypeptide or protein capable of specifically binding to Flt4, said method comprising the 
steps of: 

a) expressing a polypeptide as claimed in any of claims 1 -1 9, or a protein as claimed in any of claims 20-22, 
in a host cell, optionally as claimed in claim 30, and 
25 b) purifying a polypeptide capable of specifically binding to Flt4 from said host cell or from a growth medium 

of said host cell. 

32. A method as claimed in claim 31 comprising the steps of: 

30 growing a host cell comprising a polynucleotide including a nucleotide sequence that encodes the amino acid 

sequence setforth in SEQ ID NO: 33, underconditions in which the host cell expresses and secretes a polypep- 
tide encoded by the polynucleotide; and 

isolating a polypeptide as claimed in any of claims 1-19, or a protein as claimed in any of claims 20-22, from 
the growth medium of the host cell. 

35 

33. A method as claimed in claim 32, wherein the polypeptide isolated from the growth medium of the host cell has a 
molecular weight of approximately 23kD, as assessed by SDS-PAGE under reducing conditions, and an amino 
acid sequence comprising a portion of SEQ ID NO: 33 effective to bind Flt4 extracellular domain. 

40 34. a method as claimed in claim 32 or 33, wherein said polynucleotide comprises an expression vector that comprises 
a nucleotide sequence that encodes the amino acid sequence set forth in SFQ ID NO: 33. 

35. A method as claimed in claim 32, 33 or 34 wherein said host cell comprises a PC-3 prostatic adenocarcinoma cell 
(ATCC accession number CRL 1435). 

45 

36. A method as claimed in claim 32, wherein said polynucleotide comprises the coding insert of plasmid pFLT4-L 
deposited as under ATCC accession number 97231 . 

37. A method for detecting endothelial cells in a biological tissue comprising the steps of: 

50 

exposing a biological tissue comprising endothelial cells to a polypeptide as claimed in any of claims 1 -1 9, or 
a protein as claimed in any of claims 20-22, under conditions wherein said polypeptide or protein binds to 
endothelial cells; and 

detecting said polypeptide or protein bound to endothelial cells in said biological tissue, thereby detecting said 
55 endothelial cells. 

38. A method as claimed in 37, further comprising step of washing said biological tissue, said washing step being 
performed after said exposing step and before said detecting step. 
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39. A method of modulating the growth of mammalian endothelial cells comprising the steps of: 

exposing mammalian endothelial cells to a polypeptide as claimed in any of claims 1 -1 9, or protein as claimed 
in any of claims 20-22, in an amount effective to modulate the growth of mammalian endothelial cells. 

40. An antibody which is specifically reactive with a polypeptide as claimed in any of claims 1-1 9. 

41. An antibody as claimed in claim 40 which is a monoclonal antibody. 

42. An antibody as claimed in claim 40 or 41 which specifically binds to a peptide with an amino acid sequence con- 
sisting of the amino acid sequence set fort in SEQ ID NO: 39. 

43. An antiserum produced by a process comprising the steps of: 

immunising a mammal with a peptide with an amino acid sequence consisting of the amino acid sequence set 
fort in SEQ ID NO: 39; and 

obtaining antiserum from said mammal after said immunising step, said antiserum containing antibodies as 
claimed in claim 40 or 42. 

44. An antiserum as claimed in claim 43 wherein the mammal is immunised multiple times prior to obtaining antiserum 
from said mammal. 

45. A polypeptide as claimed in any of claims 1 -1 9, or a protein as claimed in any of claims 20-22, or a pharmaceutical 
composition as claimed in claim 23, for use in a diagnostic or clinical application. 

46. A polypeptide, protein or pharmaceutical composition as claimed in claim 45, wherein said diagnostic or clinical 
application involves angiogenesis acceleration or induction, the quantification of future metastatic risk, the promo- 
tion of re-growth or permeability of lymphatic vessels, the mitigation of loss of auxiliary lymphatic vessels following 
surgical interventions in the treatment of cancer, the treatment or prevention of inflammation, edema, aplasia of 
the lymphatic vessels, lymphatic obstruction, elephantiasis and milroy's disease, stimulation of lymphocyte pro- 
duction and maturation, the promotion or inhibition of trafficking of leukocytes between tissues and lymphatic ves- 
sels, or the migration in and out of the thymus, screening for an endothelial cell disorder, optionally a lymphatic 
vessel disorder, modulating the growth of endothelial cells in an mammalian subject, modulating endothelial cell- 
related disorders including physical loss of lymphatic vessels, lymphatic vessel occlusion and lymphangiomas. 

47. A polypeptide, protein or pharmaceutical composition as claimed in claim 45 or 46, wherein the subject of the 
diagnostic or clinical application is human. 

48. Use of a polypeptide as claimed in any of claims 1-19, a protein as claimed in any of claims 20-22 or a pharma- 
ceutical composition as claimed in claim 23 for the manufacture of a medicament for use in a diagnostic or clinical 
application as defined in claim 46. 



Patentanspruche 

1. Gereinigtes und isoliertes Polypeptid, das in der Lage ist, an Flt4-Rezeptortyrosinkinase (Flt4) zu binden, mit einer 
Aminosauresequenz umfassend einen Abschnitt der Aminosauresequenz 103-419 von SEQ ID NO:33, wobei 
der Abschnitt in der Lage ist, an Flt4 zu binden und das Polypeptide keine Aminosauresequenz entsprechend den 
Aminosauren 94 - 41 9 von SEQ ID NO:33 aufweist, in der der Lys-Rest durch einen Gin-Rest an Position 414 in 
SEQ ID NO:33 ersetzt worden ist. 

2. Polypeptid nach Anspruch 1 , wobei der Abschnitt ein Abschnitt der Aminosauresequenz ist, die durch das codie- 
rende Insert von Plasmid pFLT4-L (hinterlegt unter der ATCC-Zugangsnummer 97231) codiert ist. 

3. Polypeptide nach Anspruch 1 oder 2, wobei der Abschnitt ein Abschnitt von der Aminosauresequenz 103 - 282 
von SEQ ID NO:33 ist. 

4. Polypeptid nach Anspruch 1 oder Anspruch 2, dem die ganze Aminosauresequenz 1-102 von SEQ ID NO:33 



81 



EP 0 842 273 B1 



und die ganze Aminosauresequenz 227, 228 oder 283-419 von SEQ ID NO:33 fehlt. 

5. Polypeptid nach einem der Anspruche 1 bis 4 bestehend aus der ganzen oder einem Abschnitt der Aminosaure- 
sequenz, die durch das codierende Insert von Plasmid pFLT4-L (hinterlegt unter der ATCC-Zugangsnummer 

5 97231) codiert ist. 

6. Polypeptid nach einem der Anspruche 1 bis 4 mit einer Aminosauresequenz, die aus der ganzen oder einem 
Abschnitt der SEQ ID NO:33 besteht. 

10 7. Polypeptid nach Anspruch 1, 2 oder 3 mit einer Aminosauresequenz, die die Aminosauresequenz umfasst, die 
durch das codierende Insert von Plasmid pFLT4-L (hinterlegt unter der ATCC-Zugangsnummer 97231 ) codiert ist, 
oder der Aminosauresequenz von SEQ ID NO:33. 

8. Polypeptid nach einem der vorangehenden Anspruche mit einer Aminosauresequenz. umfassend die Aminosau- 
15 rereste oder bestehend aus den Aminosaureresten 31 -213,31 - 227, 31 - 41 9, 32 - 41 9, 32 - 227, 1 03 - 21 7, 1 03 

-225, 103-227, 103-419, 104-213, 104-227, 112-213, 112-227, 113-211, 113-213, 113-227, 131 -211 oder 
161-211 vonSEQID NO:33. 

9. Polypeptid nach Anspruch 8 mit einer Aminosauresequenz umfassend die Aminosaurereste oder bestehend aus 
20 den Aminosaureresten: 103 -227, 104 -213, 113 -213, 131 - 221 oder 161-211 von SEQ IID NO:33. 

10. Polypeptid nach einem der vorangehenden Anspruche mit einer Aminosauresequenz umfassend die Aminosau- 
rereste oder bestehend aus den Aminosaureresten 1 03 - 1 1 2 bis 226 - 227, 31 - 1 61 bis 21 1 - 227 oder 1 03 - 1 31 
bis 211 - 227 von SEQ ID NO:33. 

25 

11. Polypeptid nach Anspruch 1, 3 und 6, wobei der Abschnitt von SEQ ID NO:33 8 Cysteinreste umfasst, die in 
menschlichem Vaskularendothelwachstumsfaktor (VEGF), menschlichem aus Plattchen gewonnenen Wachs- 
tumsfaktor A (PDGF-A) und menschlichem aus Plattchen gewonnenen Wachstumsfaktor B (PDGF-B) konserviert 
sind. 

30 

12. Polypeptid nach einem der Anspruche 1 bis 7 mit einer Aminosauresequenz umfassend Aminosaurereste oder 
bestehend aus Aminosaureresten 1 03 - 282, 1 03 - 227 oder 1 03 - 223 von SEQ ID NO:33, oder ein Fragment von 
einer der Sequenzen, das an Flt4 bindet. 

35 13. Polypeptid nach einem der Anspruche 1 bis 12 mit einem scheinbaren Molekulargewicht von etwa 21 - 23 oder 
29 - 32 kD, wie durch SDS-PAGE unter reduzierenden Bedingungen bestimmt. 

14. Polypeptid nach einem der vorangehenden Anspruche, wobei das Polypeptid in der Lage ist, an die extrazellulare 
Domane der Flt4-Rezeptortyrosinkinase (Flt4) zu binden, ein scheinbares Molekulargewicht von etwa 23 kD auf- 
40 weist, wie durch SDS-PAGE unter reduzierenden Bedingungen bestimmt, und gereinigt werden kann aus kondi- 

tioniertem Medium aus einer PC-3-Prostataadenokarzinomzelllinie, wobei die Zelllinie die ATCC-Zugangsnummer 
CRL 1435 aufweist, unter Verwendung eines Affinitatsreinigungsverfahrens, wobei die Affinitatsreinieungsmatrix 
ein Polypeptid umfasst, das die extrazellulare Domane von Flt4 umfasst. 

45 15. Polypeptid nach einem der vorangehenden Anspruche umfassend eine Aminoterminale Aminosauresequenz, wie 
in SEQ ID NO:13 angegeben. 

16. Polypeptid nach einem der vorangehenden Anspruche, das in der Lage ist, an den VEGFR-2-Rezeptor zu binden 
und die Tyrosinautophosphorylierung von VEGFR-2-Rezeptor zu induzieren. 

50 

17. Polypeptid nach einem der Anspruche 1 bis 1 6, wobei das Polypeptid in der Lage ist, die Tyrosinphosphorylierung 
von Flt4 in einer Wirtszelle zu stimulieren, die Flt4 exprimiert. 

18. Polypeptid nach einem der Anspruche 1 bis 1 6, wobei das Polypeptid in der Lage ist, an die extrazellulare Domane 
55 von Flt4 zu binden. 

19. Polypeptid nach einem der Anspruche 1 bis 18 weiter umfassend eine nachweisbare Markierung. 
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20. Gereinigtes Protein umfassend ein erstes Polypeptid, das mit einem zweiten Polypeptid verbunden ist, wobei 
wenigstens das erste Polypeptid oder das zweite Polypeptid ein Polypeptid gemaG einem der Anspruche 1 bis 1 9 
ist und wobei das Protein in der Lage ist, an Flt4-Rezeptortyrosinkinase (Flt4) zu binden. 

21. Protein nach Anspruch 20, wobei das erste Polypeptid kovalent mit dem zweiten Polypeptide verbunden ist. 

22. Protein umfassend ein erstes Polypeptid, das mit einem zweiten Polypeptid verbunden ist, wobei sowohl das erste 
Polypeptid als auch das zweite Polypeptid ein Polypeptid gemaG einem der Anspruche 1 bis 19 ist. 

23. Pharmazeutische Zusammensetzung umfassend ein Polypeptid nach einem der Anspruche 1 bis 19 oder ein 
Protein nach einem der Anspruche 20 bis 23, in einem pharmazeutisch akzeptablen Verdunnungsmittel, Adjuvans, 
Bindemittel oder Trager. 

24. Gereinigtes und isoliertes Nukleinsauremolekul umfassend oder bestehend aus einer Nukleotidsequenz kodierend 
fur ein Polypeptid nach einem der Anspruche 1 bis 19, wobei das Nukleinsauremolekul kein cDNA-Molekul mit 
der DNA-Sequenz ist, die im Eintrag HS991 1 57, Zugangsnummer H 07991 .1 , in der EMBL-Datenbank angegeben 
ist. 

25. Nukleinsauremolekul nach Anspruch 24 mit einer Nukleotidsequenz umfassend oder bestehend aus einer Se- 
quenz aus benachbarten Nukleotiden von Nukleotidsequenz 352 - 1608 von SEQ ID NO:32. 

26. Nukleinsauremolekul nach Anspruch 24, umfassend oder bestehend aus einem VEGF-C codierenden Insert aus 
Plasmid pFI_T4-L, das unter der ATCC-Zugangsnummer 97231 hinterlegt ist. 

27. Vektor umfassend ein Nukleinsauremolekul nach einem der Anspruche 24 bis 26. 

28. Vektor nach Anspruch 27 mit einem kodierenden Insert umfassend oder bestehend aus einem Nukleinsauremo- 
lekul nach einem der Anspruche 24 bis 26. 

29. Vektor nach Anspruch 27, wobei der Vektor das Plasmid pFLT4-L ist, das unter der ATCC-Zugangsnummer 97231 
hinterlegt ist. 

30. Wirtszelle, die mit einem Vektor nach einem der Anspruche 27 bis 29 transfiziert oder transformiert ist. 

31. Verfahren zum Herstellen eines Polypeptids oder Proteins, das in der Lage ist, spezifisch an Flt4 zu binden, wobei 
das Verfahren die Schritte umfasst: 

a) Exprimieren eines Polypeptids nach einem der Anspruche 1 bis 19, oder eines Proteins nach einem der 
Anspruche 20 bis 22, in einer Wirtszelle, optional wie beansprucht in Anspruch 30, und 

b) Reinigen eines Polypeptids, das in der Lage ist, spezifisch an Flt4 aus der Wirtszelle oder aus einem Wachs- 
tumsmedium der Wirtszelle zu binden. 

32. Verfahren nach Anspruch 31 umfassend die Schritte 

Anziehen einer Wirtszelle umfassend ein Polynukleotid umfassend eine Nukleotidsequenz, die die Amino- 
sauresequenz codiert, die in SEQ ID NO:33 angegeben ist, unter Bedingungen, unter denen die Wirtszelle ein 
Polypeptid exprimiert und sekretiert, das durch das Polynukleotid kodiert ist; und 

Isolieren eines Polypeptids nach einem der Anspruche 1 bis 19, oder eines Proteins nach einem der An- 
spruche 20 bis 22, aus dem Wachstumsmedium der Wirtszelle. 

33. Verfahren nach Anspruch 32, wobei das aus dem Wachstumsmedium der Wirtszelle isolierte Polypeptid ein Mo- 
lekulargewicht von etwa 23 kD, wie bestimmt durch SDS-PAGE unter reduzierenden Bedingungen, und eine Ami- 
nosauresequenz umfassend einen Abschnitt von SEQ ID NO:33 aufweist, derwirksam ist, urn an die extrazellulare 
Domane von Flt4 zu binden. 

34. Verfahren nach Anspruch 32 oder 33, wobei das Polynukleotid einen Expressionsvektor umfasst, der eine Nu- 
kleotidsequenz umfasst, die die in SEQ ID NO:33 angegebene Aminosauresequenz codiert. 

35. Verfahren nach Anspruch 32, 33 oder 34, wobei die Wirtszelle eine PC-3-Prostataadenokarzinomzeile (ATCC-Zu- 
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gangsnummer CRL 1435) umfasst. 

36. Verfahren nach Anspruch 32, wobei das Polynukleotid das codierende Insert von Plasmid pFLT4-L umfasst, das 
unter der ATCC-Zugangsnummer 97231 hinterlegt ist. 

37. Verfahren zum Nachweis von Endothelzellen in einem biologischen Gewebe umfassend die Schritte: 

Exponieren eines biologischen Gewebes, das Endothelzellen umfasst, gegenuber einem Polypeptid nach 
einem der Anspruche 1 bis 1 9, oder einem Protein nach einem der Anspruche 20 bis 22, unter Bedingungen, 
unter denen das Polypeptid oder Protein an Endothelzellen bindet; und 

Nachweisen des an Endothelzellen in dem biologischen Gewebe gebundenen Polypeptids oder Proteins, 
wodurch die Endothelzellen nachgewiesen werden. 

38. Verfahren nach Anspruch 37, weiter umfassend den Schritt des Waschens des biologischen Gewebes, wobei der 
Waschschritt nach dem Expositionsschritt und vor dem Nachweisschritt durchgefuhrt wird. 

39. Verfahren zum Modulieren des Wachstums von Saugerendothelzellen umfassend die Schritte: 

Exponieren von Saugerendothelzellen gegenuber einem Polypeptid nach einem der Anspruche 1 bis 1 9, oder 
einem Protein nach einem der Anspruche 20 bis 22, in einer Menge, die wirksam ist, urn das Wachstum von 
Saugerendothelzellen zu modulieren. 

40. Antikorper, der spezifisch mit einem Polypeptid nach einem der Anspruche 1 bis 19 reaktiv ist. 

41. Antikorper nach Anspruch 40, der ein monoklonaler Antikorper ist. 

42. Antikorper nach Anspruch 40 oder 41 , der spezifisch an ein Peptid mit einer Aminosauresequenz bestehend aus 
der Aminosauresequenz, die in SEQ ID NO:39 angegeben ist, bindet. 

43. Antiserum, hergestellt durch ein Verfahren umfassend die Schritte: 

Immunisieren eines Saugetiers mit einem Peptid mit einer Aminosauresequenz bestehend aus der Amino- 
sauresequenz, die in SEQ ID NO:39 angegeben ist; und 

Erhalten von Antiserum von dem Saugetier nach dem Immunisierungsschritt, wobei das Antiserum Antikorper 
nach Anspruch 40 oder 42 enthalt. 

44. Antiserum nach Anspruch 43, wobei das Saugetier mehrfach vor dem Erhalten des Antiserums von dem Saugetier 
immunisiert wird. 

45. Polypeptid nach Anspruch 1 bis 1 9, oder Protein nach Anspruch 20 bis 22, oder pharmazeutische Zusammenset- 
zung nach Anspruch 23, zur Verwendung bei einer diagnostischen oder klinischen Anwendung. 

46. Polypeptid, Protein oder pharmazeutische Zusammensetzung nach Anspruch 45, wobei die diagnostische oder 
klinische Anwendung Angiogenesebeschleunigung oder Angiogeneseinduktion, die Quantifizierung von zukunfti- 
gem Metastaserisiko, die Forderung von erneutem Wachstum oder Permeabilitat von lymphatischen GefaGen, die 
Linderung der Verlustes von lymphatischen HilfsgefaGen nach chirurgischen Interventionen bei der Behandlung 
von Krebs, die Behandlung oder Prevention von Entzundung, Odem, Aplasie der lymphpatischen GefaGe, lym- 
phatische Obstruktion, Elefantiasis und Milroyschem Syndrom, Stimulierung von Lymphozytenproduktion und 
-Reifung, die Forderung oder Inhibierung des Leitens von Leukozyten zwischen Geweben und lymphatischen 
GefaGen, oder die Wanderung in und aus dem Thymus, Screenen auf Endothelzellstorungen, optional eine Lymph- 
gefaGstorung, Modulieren des Wachstums von Endothelzellen in einem Saugetier, Modulieren von mit Endothel- 
zellen im Zusammenhang stehenden Erkrankungen einschlieGlich physikalischer Verlust von lymphatischen Ge- 
faGen, LymphgefaGokklusion und Lymphangiomen umfasst. 

47. Polypeptid, Protein oder pharmazeutische Zusammensetzung nach Anspruch 45 oder 46, wobei das Lebewesen 
der diagnostischen oder klinischen Anwendung ein Mensch ist. 
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48. Verwendung eines Polypeptids nach einem der Anspruche 1 bis 1 9, eines Proteins nach einem der Anspruche 20 
bis 22 oder einer pharmazeutischen Zusammensetzung nach Anspruch 23 zur Herstellung eines Medikaments 
zur Verwendung bei einer diagnostischen oder klinischen Anwendung, wie in Anspruch 46 definiert. 

5 

Revendications 

1. Polypeptide purifie et isole qui est capable de lier le recepteur Flt4 tyrosine kinase (Flt4), avec une sequence 
d'acides amines comprenant une partie de la sequence d'acides amines 1 03-41 9 de SEQ ID NO: 33, dans lequel 

10 ladite partie est capable de se lier a Flt4 et ledit polypeptide n'inclut pas une sequence d'acides amines corres- 

pondant aux acides amines 94-419 de SEQ ID NO: 33 dans laquelle un residu Gin a ete substitue a la place du 
residu Lys en position 414 de la SEQ ID NO: 33. 

2. Polypeptide selon la revendication 1 , dans lequel ladite partie est une partie de la sequence d'acides amines codee 
15 par I'insert codant du plasmide pFLT4-L (depose sous le numero d'acces ATCC 97231). 

3. Polypeptide selon la revendication 1 ou la revendication 2, dans lequel la partie est une partie de la sequence 
d'acides amines 103-282 de SEQ ID NO: 33. 

20 4. Polypeptide selon la revendication 1 ou la revendication 2, qui a perdu toute la sequence d'acides amines 1-102 
de SEQ ID NO: 33 et toute la sequence d'acides amines 227, 228 ou 283-419 de SEQ ID NO: 33. 

5. Polypeptide selon Tune quelconque des revendications 1 a 4, constitue de tout ou partie de la sequence d'acides 
amines codee par I'insert codant du plasmide pFLT4-L (depose sous le numero d'acces ATCC 97231). 

25 

6. Polypeptide selon Tune quelconque des revendications 1 a 4, avec une sequence d'acides amines constitute de 
tout ou partie de SEQ ID NO: 33. 

7. Polypeptide selon la revendication 1 , 2 ou 3, avec une sequence d'acides amines comprenant la sequence d'acides 
30 amines codee par I'insert codant du plasmide pFLT4-L (depose sous le numero d'acces ATCC 97231), ou la se- 
quence d'acides amines de SEQ ID NO:33. 

8. Polypeptide selon I'une quelconque des revendications precedentes avec une sequence d'acides amines incluant 
les ou constitute des residus d'acides amines 31-213, 31-227, 31-419, 32-419, 32-227, 103-217, 103-225, 

35 1 03-227, 1 03-41 9, 1 04-21 3, 1 04-227, 1 1 2-213, 1 1 2-227, 1 1 3-21 1,11 3-21 3, 1 1 3-227, 1 31 -21 1 ou 1 61 -21 1 de SEQ 

ID NO:33. 

9. Polypeptide selon la revendication 8, avec une sequence d'acides amines incluant les ou constitute des residus 
d'acides amines: 103-227, 104-213, 113-213,131-221 ou 161-211 de SEQ ID NO: 33. 

40 

10. Polypeptide selon I'une quelconque des revendications precedentes, avec une sequence d'acides amines incluant 
les ou constitute des rtsidus d'acides amints 1 03-1 1 2 a 226-227, 31-161 a 21 1 -227 ou 1 03-1 31 a 21 1 -227 de 
SEQ ID NO: 33. 

45 11. Polypeptide selon I'une quelconque des revendications 1, 3 et 6, dans lequel la partie de SEQ ID NO:33 inclut 8 
rtsidus cysttine qui sont conservts dans le facteur de croissance endothtlial vasculaire humain (VEGF), le facteur 
de croissance plaquettaire A (PDGF-A) et le facteur de croissance plaquettaire B (PDGF-B). 

12. Polypeptide selon I'une quelconque des revendications 1 a 7 avec une sequence d'acides amints comprenant les 
50 ou constitute des rtsidus d'acides amints 103-282, .103-227 ou 103-223 de SEQ ID NO: 33, ou d'un fragment 

desdites stquence qui se lie a Flt4. 

13. Polypeptide selon I'une quelconque des revendications 1 a 12 ayant un poids moleculaire apparent d'environ 21-23 
ou 29-32 kD tel qu'il est dttermint par EGPA-SDS dans des conditions rtductrices. 



55 



14. Polypeptide selon I'une quelconque des revendications prtctdentes, ledit polypeptide ttant capable de se lier au 
domaine extracellulaire du rtcepteur Flt4 de la tyrosine kinase (Flt4), ayant un poids moleculaire apparent d'ap- 
proximativement 23kD, tel qu'il est dttermint par EGPA-SDS dans des conditions rtductrices, et ttant purifiable 
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a partir de milieux conditionnes d'une lignee cellulaire d'adenocarcinome prostatique PC-3, ladite lignee cellulaire 
ayant un numero d'acces ATCC CRL 1435, en utilisant une methode de purification d'affinite dans laquelle la 
matrice de purification d'affinite comprend un polypeptide comprenant le domaine extracellulaire de Flt4. 

15. Polypeptide selon Tune quelconque des revendications precedentes, comprenant une sequence amino terminale 
d'acides amines comme indique dans SEQ ID NO: 13. 

16. Polypeptide selon Tune quelconque des revendications precedentes, capable de se lier a et d'induire I'autophos- 
phorylation de tyrosine du recepteur VEGFR-2. 

17. Polypeptide selon Tune quelconque des revendications 1 a 16, ou ledit polypeptide est capable de stimuler la 
phosphorylation de tyrosine de Flt4 dans une cellule note exprimant Flt4. 

18. Polypeptide selon Tune quelconque des revendications 1 a 16, ou ledit polypeptide est capable de se lier au 
domaine extracellulaire de Flt4. 

19. Polypeptide selon Tune quelconque des revendications 1 a 18, comprenant en outre un marqueur detectable. 

20. Proteine purifiee comprenant un premier polypeptide lie a un second polypeptide, dans laquelle au moins un parmi 
le dit premier polypeptide et le dit second polypeptide est un polypeptide conforme a Tune quelconque des reven- 
dications 1 a 19, et ou ladite proteine est capable de se lier au recepteur Flt4 de la tyrosine kinase (Flt4). 

21. Proteine selon la revendication 20, dans laquelle ledit premier polypeptide est relie de maniere covalente audit 
second polypeptide. 

22. Proteine comprenant un premier polypeptide lie a un second polypeptide, dans lequel chacun dudit premier poly- 
peptide et dudit second polypeptide est un polypeptide selon Tune quelconque des revendications 1 a 19. 

23. Composition pharmaceutique comprenant un polypeptide tel que revendique dans Tune quelconque des revendi- 
cations 1 a 19, ou une proteine selon Tune quelconque des revendications 20 a 22, dans un diluant, adjuvant, 
excipient ou vehicule pharmaceutiquement acceptable. 

24. Molecule d'acide nucleique isole comprenant une ou constitute d'une sequence nucleotidique codant un polypep- 
tide conforme a I'une quelconque des revendications 1 a 19, ou ladite molecule d'acide nucleique n'est pas une 
molecule d'ADNc avec la sequence d'ADN indiquee dans I'entree HS991157, numero d'acces H07991 .1 , dans la 
base de donnee EMBL. 

25. Molecule d'acide nucleique selon la revendication 24, avec une sequence nucleotidique comprenant une ou cons- 
titute d'une sequence contigue de nucleotides de la sequence nucleotidique 352-1608 de SEQ ID NO: 32. 

26. Molecule d'acide nucleique selon la revendication 24, comprenant un ou constitute d'un insert de plasmide 
pFLT4-L codant VEGF-C, depose sous le numero d'acces ATCC 97231. 

27. Vecteur comprenant une molecule d'acide nucleique selon I'une quelconque des revendications 24 a 26. 

28. Vecteur selon la revendication 27, avec un insert codant comprenant une ou constitute d'une molecule d'acide 
nucltique selon I'une quelconque des revendications 24 a 26. 

29. Vecteur selon la revendication 27, ledit vecteur ttant le plasmide pFLT-4L, dtpost sous le numtro d'acces ATCC 
97231. 

30. Cellule-hote transfectte ou transformte avec un vecteur selon I'une quelconque des revendications 27 a 29. 

31 . Mtthode de production d'un polypeptide ou d'une prottine capable de se lier sptcifiquement a Flt4, ladite mtthode 
comprenant les ttapes de: 

a) expression d'un polypeptide selon I'une quelconque des revendications 1 a 1 9 ou d'une prottine selon I'une 
quelconque des revendications 20 a 22, dans une cellule-hote, optionnellement selon la revendication 30, et 
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b) purification d'un polypeptide capable de se lier specifiquement a Flt4 de ladite cellule-hote ou d'un milieu 
de culture de ladite cellule-hote. 

32. Methode selon la revendication 31 , comprenant les etapes de: 

culture d'une cellule-hote comprenant un polynucleotide incluant une sequence nucleotidique qui code la se- 
quence d'acides amines indiquee dans SEQ ID NO: 33, dans des conditions dans lesquelles la cellule-hote 
exprime et secrete un polypeptide code par le polynucleotide; et 

isolement d'un polypeptide selon Tune quelconque des revendications 1 a 1 9, ou d'une proteine conforme a 
I'une quelconque des revendications 20 a 22, a partir du milieu de culture de la cellule-hote. 

33. Methode selon la revendication 32, dans laquelle le polypeptide isole a partir du milieu de culture de la cellule- 
hote a un poids moleculaire d'environ 23kD, tel qu'il est determine par EGPA-SDS dans des conditions reductrices, 
et une sequence d'acides amines comprenant une partie de SEQ ID NO: 33 efficace pour lier le domaine extra- 
cellulaire de Flt4. 

34. Methode selon la revendication 32 ou 33, dans laquelle ledit polynucleotide comprend un vecteur d'expression 
qui comprend une sequence nucleotidique qui code la sequence d'acides amines indiquee dans SEQ ID NO: 33. 

35. Methode selon la revendication 32, 33 ou 34, dans laquelle ladite cellule-hote comprend une cellule d'adenocar- 
cinome prostatique PC-3 (numero d'acces ATCC CRL 1435). 

36. Methode selon la revendication 32, dans laquelle ledit polynucleotide comprend I'insert codant du plasmide 
pFLT4-L depose sous le numero d'acces ATCC 97231 . 

37. Methode pour detecter les cellules endothelials dans un tissu biologique comprenant les etapes consistant a: 

exposer un tissu biologique comprenant des cellules endothelials a un polypeptide selon I'une quelconque 
des revendications 1 a 19, ou a une proteine selon I'une quelconque des revendications 20 a 22, dans des 
conditions dans lesquelles ledit polypeptide ou ladite proteine se lie aux cellules endothelials; et 
detecter ledit polypeptide ou ladite proteine lie(e) aux cellules endothelials dans ledit tissu biologique, de- 
tectant ainsi lesdites cellules endotheliales. 

38. Methode selon la revendication 37, comprenant en outre I'etape de lavage dudit tissu biologique, ladite etape de 
lavage etant effectuee apres ladite etape d'exposition et avant ladite etape de detection. 

39. Methode de modulation de la croissance des cellules endotheliales de mammifere comprenant les etapes consis- 
tant a: 

exposer les cellules endotheliales de mammifere a un polypeptide selon I'une quelconque des revendications 
1 a 19, ou a la proteine selon I'une quelconque des revendications 20 a 22, a une quantite efficace pour 
moduler la croissance des cellules endotheliales de mammifere. 

40. Anticorps qui est specifiquement reactif avec un polypeptide selon I'une quelconque des revendications 1 a 19. 

41. Anticorps selon la revendication 40 qui est un anticorps monoclonal. 

42. Anticorps selon la revendication 40 ou 41 , qui se lie specifiquement a un peptide avec une sequence d'acides 
amines constitute de la sequence d'acides amines indiquee dans SEQ ID NO: 39. 

43. Antiserum produit par un procede comprenant les etapes de: 

immunisation d'un mammifere avec un peptide avec une sequence d'acides amines constitute de la sequence 
d'acides amines indiquee dans SEQ ID NO: 39; et 

obtention de I'antiserum dudit mammifere apres ladite etape d'immunisation, ledit antiserum contenant les 
anticorps selon la revendication 40 ou 42. 

44. Antiserum selon la revendication 43, pour lequel le mammifere est immunise de multiples fois avant d'obtenir 
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I'antiserum dudit mammifere. 

45. Polypeptide selon Tune quelconque des revendications 1 a 19, ou proteine selon Tune quelconque des revendi- 
cations 20 a 22, ou composition pharmaceutique selon la revendication 23, pour utilisation dans une application 
diagnostique ou clinique. 

46. Polypeptide, proteine ou composition pharmaceutique selon la revendication 45, avec lequel ladite application 
diagnostique ou clinique implique ('acceleration ou I'induction de I'angiogenese, la quantification du risque metas- 
tatique futur, la promotion de la repousse ou de la permeabilite des vaisseaux lymphatiques, la reduction de la 
perte de vaisseaux lymphatiques auxiliaires consecutive aux interventions chirurgicales dans le traitement du 
cancer, le traitement ou la prevention de: inflammation, oedeme, aplasie des vaisseaux lymphatiques, obstruction 
lymphatique, elephantiasis et maladie de milroy, stimulation de la production et de la maturation des lymphocytes, 
promotion ou inhibition du trafic des leucocytes entre les tissus et les vaisseaux lymphatiques, ou migration a 
I'interieur et a I'exterieur du thymus, criblage pour un trouble de la cellule endothelial, facultativement un trouble 
des vaisseaux lymphatiques, modulation de la croissance des cellules endothelials chez un sujet mammifere, 
modulation des troubles associes a la cellule endothelial incluant la perte physique de vaisseaux lymphatiques, 
I'occlusion des vaisseaux lymphatiques et les lymphangiomes. 

47. Polypeptide, proteine ou composition pharmaceutique selon la revendication 45 ou 46, pour lequel le sujet de 
I'application diagnostique ou clinique est un humain. 

48. Utilisation d'un polypeptide selon Tune quelconque des revendications 1 a 19, d'une proteine selon Tune quelcon- 
que des revendications 20 a 22, ou d'une composition pharmaceutique selon la revendication 23 pour la fabrication 
d'un medicament pour I'utilisation dans une application diagnostique ou clinique telle que definie a la revendication 
46. 
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Figure 2 (1 of 2) 
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Figure 2 (2 of 2) 
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Figure 7 
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VEGF-C alignment 



1 50 
Hum HMLLGFFSVA CSLLAAALLP GPREAPAAAA AFESGLDLSD AEFDAGEATA 
Hou KFT.LCFL8LA C SLLAAALI P SPREAPATVA AFE6GLGFSE AEPDGGEVKA 
Qua MHLLEMLSLG CCLAAGAVLL GPRQPPVA.A AYESGHGYYE EEPGAG EPKA 





51 


Hum 


YASKDLBSQZ* 


Mou 


FEGKDLEEQL 


Qua 


HASJUDLEEQL 




101 


Hum 


SRTEETIKFA 


Mou 


TRTGDSVKFA 


Qua 


TRSDDSLKFA 




151 


Hum 


FFKPPCV8VY 


Mou 


FFKPPCVSVY 


Qua 


FFKPPCVSIY 




201 


Hum 


ISFAHBMCR 


Mou 


ISFAKHTSCR 


Qua 


VfiFANHTfiCR 



100 



ISO 



200 



250 



251 300 

Hum NHICRCLAQE DFMFSSDAGD DSTDGFHDIC GPNKELDEET CQCVCRAGLR 

Mou NYMCRCIAQQ DFIFYSNVED DSTNGFHDVC GPNKELDEDT CQCVCKGGLR 

Qua NQICRCLAQH DFGF56HLGD SDTSEGFHIC GPNKELDEET CQCVCKGGVR 
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